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Modtaget PD 
1 2 J SEP, 1999 

Improved phytases 

Phytases are enzymes that hydrolyze phytate (myo-inositol 
hexakisphosphate) to myo-inositol and inorganic phospha-te and 
5 are known to be valuable feed additives. 

Preferred phytases are myo-inositol hexakisphosphate 
phosphohydrola-ses, such as (myo-inositol hexaphosphate 3- 
phosphohydrolase, EC 3.1.3.8) and (myo-inositol hexaphosphate 6- 
phosphohydrolase, EC 3.1.3.26). 

A phytase was first described in rice bran in 1907 
[Suzuki et al.. Bull. Coll. Agr. Tokio Imp. Univ. 7, 495 (1907)] 
and phytases from Aspergillus species in 1911 [Dox and Golden, 
J. Biol. Chem. 10, 183-186 (1911)]. Phytases have also been 
found in wheat bran, plant seeds, animal intestines and in 
15 microorganisms [Howsen and Davis, Enzyme Microb. Technol. 5, 
377-382 (1983), Lambrechts et al., Biotech. Lett. 14, 61-66 
(1992), Shieh and ware, Appl. Microbiol. 16, 1348-1351 (1968)1. 

The cloning and expression of the phytase from Aspergillus 
niger (ficuum) has been described by Van Hartingsveldt et al., 
20 in Gene, 127, 87-94 (1993) and in European Patent Application, 
Publication NO. (EP) 420 358 and from Aspergillus niger var. 
awamori by Piddington et al., in Gene 133, 55-62 (1993). 

Cloning, expression and purification of phytases with 
improved properties have been disclosed in EP 684 313. However, 
since there is a still ongoing need for further improved 
phytases, especially with respect to their thermostability, it 
is an object of the present invention to provide the following 
process which is, however, not only applicable to phytases. 

The present invention relates to improved phytases, viz. 
30 phytases of amended characteristics, preferably amended activity 
characteristics, amended as compared to the phytase (s) it has 
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been derived from, preferably amended as compared to known 
phytases. Amended activity characteristics means amended in at 
least one phytase activity related respect, such as (non- 
exclusive list): pH stability, temperature stability, pH 
5 profile, temperature profile, specific activity (in particular 
in relation to pH and temperature), substrate specificity, 
substrate cleavage pattern, substrate binding, position 
specificity, the velocity and level of release of phosphate from 
corn, reaction rate, phytate degradation rate), end level of 
10 released phosphate reached. 

Preferred amended activity characteristics are amended 
specific activity, preferably increased, and preferably 
increased at a pH of 3, 4, 5, or 6; amended pH or temperature 
profile; and/or amended, preferably increased, thermostability, 
15 e.g. of an increased melting temperature as measured using DSC. 

The present invention also relates to a process for the 
preparation of a modified protein, wherein in a first step a 
consensus sequence is determined from a number of highly 
homologous sequences according to steps a), b) and c) below: 
20 a) at least three, preferably four amino acid sequences 

are aligned by any standard alignment program known in the art; 

b) amino acids at the same position according to such 
alignment are compared regarding their evolutionary similarity 
by any standard program known in the art, whereas the degree of 
25 similarity provided by such a program which defines the least 
similarity of the amino acids that is used for the determination 
of an amino acid of corresponding positions is set to a less 
stringent number and the parameters are set in such a way that 
it is possible for the program to determine from only 2 
30 identical amino acids at a corresponding position an amino acid 
for the consensus protein; however, if among the compared amino 
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acid sequences are sequences that show a much higher degree of 
similarity to each other than to the residual sequences, these 
sequences are represented by their consensus sequence determined 
as defined in the same way as in the present process for the 
5 consensus sequence of the consensus protein or a vote weight of 
1 divided by the number of such sequences is assigned to every 
of -those sequences; 

c) in case no common amino acid at a defined position 
is identified by the program, any of the amino acids, preferably 
10 the most frequent amino acid of all such sequences is selected; 

in a second step the amino acid sequence of another 
protein which is homologous to the consensus sequence is 
compared with the consensus sequence; and 

in a third step the consensus sequence or the other 
15 protein sequence is modified to define a modified sequence; 

the modified sequence is back-translated into a DNA 
sequence, preferably by using a codon frequency table of the 
organism in which expression should take place; 

the DNA sequence is synthesized by methods known in the 
20 art and used either integrated into a suitable expression vector 
or by itself to transform an appropriate host cell; 

the transformed host cell is grown under suitable culture 
conditions and the other protein is isolated from the host cell 
or its culture medium by methods known in the art. 
25 In one aspect of the above process, a modified other 

protein sequence is defined in the third step as follows: Only 
those amino acid residues .are replaced in the amino acid 
sequence of the other protein which clearly differ from the 
consensus sequence of this protein family calculated under 
30 moderately stringent conditions whereas at all positions of the 
alignment where no preferred single amino acid can be determined 
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under moderately stringent conditions the amino acids of the 
other protein remain unchanged. 

In another aspect, the second step comprises determining 
the active center of the protein, comprising all amino acid 
5 residues that are involved in forming the active center, in the 
consensus sequence, and in the sequence of the other protein as 
well; and in the third step a modified consensus sequence is 
defined as follows: Some or all of the amino acids that form the 
active center of the other protein are inserted in the backbone 
10 of the consensus sequence. 

in a preferred process, the program used for the 
comparison of amino acids at a defined position regarding their 
evolutionary similarity is the program "PRETTY". 

Preferably, the active center of the protein is determined 
15 by using an analysis of the three-dimensional structure of the 
protein . 

A preferred homologeous protein is an enzyme, a preferred 
defined protein family is the family of phytases, preferably of 
fungal origin- 

preferably the amino acid sequence of the phytase is 
changed by the introduction of at least one mutation selected 
from the group consisting of 



20 



25 



30 



E58A 


F54Y 


D69K 


I73V 


D197N 


K94A 


T214L 


RIOIA 


E222T 


N153K 


E267D 


V158I 


R291I 


A203G 


R329H 


S205G 


S364T 


V217A 
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A379K A227V 

G404A V234L ^ 

P238A 
Q277E 

5 A287H 

A292Q 
V366I 
A396S 
E415Q 

10 G437A 

E451R 

whereby the number represents the position in the 
consensus phytase sequence or a corresponding residue according 
-to an alignment as shown in Fig. 1 when 26 amino acids (signal 
15 sequence) are added to the sequences shown in Fig. 1 and the 
letter before the number represents the amino acid in the 
phytase which is replaced by the amino acid behind the number. 

Preferred host cells are of eukaryotic origin, preferably 
fungal, such as Aspergillus, or yeast, preferably Saccharomyces 

20 or Hansenula. 

In another aspect, the invention relates to a modified 
protein obtainable preferably obtained by any of the above 
described processes. 

In a further aspect, the invention relates to a mutein of 
25 the consensus phytase-1, characterized therein that in the amino 
acid sequence of Figure 2 the following replacements have been 
effected Q50L, Q50T, Q50G, Q50T-Y51N, Q50L-Y51N or Q50T-K91A. 

In a preferred embodiment of this process step b) can 
also be defined as follows: 
30 b) amino acids at the same position according to such an 

alignment are compared regarding their evolutionary similarity 
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by any standard program known in the art, whereas the degree of 
similarity provided by such program is set at the lowest 
possible value and the amino acid which is the most similar for 
at least half of the sequences used for the comparison is 
5 selected for the corresponding position in the amino acid 
sequence of the consensus protein. 

By using the above processes the consensus sequence 
derived from a number of highly homologous sequences can be used 
in order to replace only certain amino acid residues in the 

10 protein in such a manner that only those amino acid residues are 
replaced which clearly and unambiguously differ from the 
corresponding consensus sequence of this protein family which 
has been calculated on moderately stringent conditions- At all 
other positions of the alignment, however, where the method of 

15 the present invention is not able to determine clearly a 
preferred amino acid residue under moderately stringent 
conditions the amino acid residues of the other protein are 
maintained unchanged . 

In the alternative, a consensus sequence is determined 

20 from homologous sequences as described above. In a second step 
the active center of the protein comprising all amino acid 
residues that are involved in forming the active center is 
determined in the consensus sequence and in the sequence of a 
single homologous protein as well. The single homologous protein 

25 may have preferred properties like high specific activity or 
different pH dependency of enzymatic activity. In a third step 
some or all amino acid residues that are involved in forming the 
active centre of the homologeous protein are inserted into the 
backbone of the consensus sequence. The result thereof is a 

30 chimeric protein having the active centre derived from a single 
protein and the backbone of the consensus sequence. 
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The active centre of the protein can be determined e.g. by 
using any analysis of the three-dimensional structure of the 
protein, e.g. by homology modelling on the basis of a known 3D- 
structure of a known protein. Frequently the single homologeous 
5 protein is an enzyme. 

It is also an object of the present invention to provide a 
consensus protein obtainable preferably obtained, by such 
processes and specifically the consensus protein, which has the 
amino acid sequences shown in Figures 2, 4 and 6 or a variant 

10 thereof. A "variant" refers in the context of the present 
invention to a consensus protein with amino acid sequence shown 
in Figure 2, 5, 7, and 8 wherein at one or more positions amino 
acids have been deleted, added or replaced by one or more other 
amino acids with the proviso that the resulting sequence 

15 provides for a protein whose basic properties like enzymatic 
activity (type of and specific activity) , thermostability, 
activity in a certain pH-range (pH-stability ) have not 
significantly been changed. "Significantly" means in this 
context that a man skilled in the art would say that the 

20 properties of the variant may still be different but would not 
be unobvious over the ones of the consensus protein with the 
amino acid sequence of Figure 2 itself. 

A "mute in" refers in the context of the present invention 
to replacements of the amino acid in the amino acid sequences of 

25 the consensus proteins shown in Figure 2 which lead to consensus 
proteins with further improved properties e.g. activity. Such 
muteins can be defined and prepared on the basis of the 
teachings given in European Patent Application number 
97810175.6, e. g. Q50L, Q50T, Q50G, Q50L-Y51N, or Q50T-Y51N. 

30 "Q50L" means in this context that at position 50 of the amino 
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acid sequence (Figure 2) the amino acid Q has been replaced by 
amino acid L. 

In addition, a food, feed or pharmaceutical composition 
comprising a consensus protein as defined above is also an 
5 object of the present invention. 

In this context "at least three preferably four amino acid 
sequences of such defined protein family" means that three, 
four, five, six to 12, 20, 50 or even more sequences can be used 
for the alignment and the comparison to create the . amino acid 

10 sequence of the consensus protein- "Sequences of a defined 
protein family" means that such sequences fold into a three 
dimensional structure, wherein the alpha-helixes, the beta- 
sheets and beta-turns are at the same position so that such 
structures are, as called by the man skilled in the art, largely 

15 super imposable. Furthermore these sequences characterize 
proteins which show the same type of biological activity, e.g. a 
defined enzyme class, e.g. the phytases. As known in the art, 
the three dimensional structure of one of such sequences is 
sufficient to allow the modelling of the structure of the other 

20 sequences of such a family. An example, how this can be 
effected, is given in the Reference Example of the present case, 
"Evolutionary similarity" in the context of the present 
invention refers to a scheme which classifies amino acids 
regarding their structural similarity which allows that one 

25 amino acid can be replaced by another amino acid with a minimal 
influence on the overall structure, as this is done e.g. by 
programs, like "PRETTY", known in the art. The phrase "the 
degree of similarity provided by such a program.. -is set to less 
stringent number" means in the context of the present invention 

30 that values for the parameters which determine the degree of 
similarity in the program used in the practice of the present 
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invention are chosen in a way to allow the program to define a 
common amino acid for a maximum of positions of the whole amino 
acid sequence, e. g. in case of the program PRETTY a value of 2 
or 3 for the THRESHOLD and a value of 2 for the PLURALITY can be 
5 choosen. Furthermore, "a vote weight of one divided by the 
number of such sequences" means in the context of the present 
invention that the sequences which define a group of sequences 
with a higher degree of similarity as the other sequences used 
for the determination of the consensus sequence only contribute 

10 to such determination with a factor which is equal to one 
divided by a number of all sequences of this group. 

As mentioned before should the program not allow to select 
the most similar amino acid, the most frequent amino acid is 
selected, should the latter be impossible the man skilled in the 

15 art will select an amino acid from all the sequences used for 
the comparison which is known in the art for its property to 
improve the thermostability in proteins as discussed e.g. by 
Janecek, S. (1993), Process Biochem. 28, 435-445 or Fersht, A. 
R, & Serrano, L. (1993), Curr. Opin. Struct, Biol. 3, 75-83. 

20 Alber, T. (1989), Annu. Rev, Biochem- 58, 765-798 or Matthews, 
B. W. (1987) , Biochemistry 26, 6885-68 88 - Matthews, B. W. 
(1991), Curr- Opin, Struct, Biol, 1, 17-21. 

The stability of an enzyme is a critical factor for many 
industrial applications . Therefore, a lot of attempts, more or 

25 less successful, have been made to improve the stability, 
preferably the thermostability of enzymes by rational {van den 
Burg et al., 1998) or irrational approaches (Akanuma et al,, 
1998). The forces influencing the thermostability of a protein 
are the same as those that are responsible for the proper 

30 folding of a peptide strand (hydrophobic interactions, van der 
Waals interactions, H-bonds, salt bridges, conformational strain 
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(Matthews, 1993). Furthermore, as shown by Matthews et al. 
(1987), the free energy of the unfolded state has also an 
influence on the stability of a protein. Enhancing of protein 
stability means to increase the number and strength of favorable 
5 interactions and to decrease the number and strength of 
unfavorable interactions. It has been possible to introduce 
disulfide linkages (Sauer et al, 1986) to replace glycine with 
alanine residues or to increase the proline content in order to 
reduce the free energy of the unfolded state (Margarit et al, 

10 1992; Matthews, 1987a) . Other groups concentrated on the 
importance of additional H-bonds or salt bridges for the 
stability of a protein (Blaber et al, 1993) or tried to fill 
cavities in the protein interior to increase the buried 
hydrophobic surface area and the van der Waals interactions 

15 (Karpusas et al, 19898) . Furthermore, the stabilization of 
secondary structure elements, especially a-helices, for example, 
by improved helix capping, was also investigated (Munoz & 
Serrano, 1995) . 

However, there is no fast and promising strategy to 

20 identify amino acid replacements which will increase the 
stability, preferably the thermal stability of a protein. 
Commonly, the 3D structure of a protein is required to find 
locations in the molecule where an amino acid replacement 
possibly will stabilize the protein's folded state. Alternative 

25 ways to circumvent this problem are either to search for a 
homologous protein in a thermo- or hyperthermophile organism or 
to detect stability-increasing amino acid replacements by a 
random mutagenesis approach. This latter possibility succeeds in 
only 103 to 104 mutations and is restricted to enzymes for which 

30 a fast screening procedure is available (Arase et al, 1993; 
Risse et al, 1992) . For all these approaches, success was 
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variable and unpredictable and, if successful, the 
thermostability enhancements nearly always were rather small. 

Here we present an alternative way to improve the 
thermostability of a protein. Imanaka et al (1986) were among 
5 the first to use the comparisons of homologous proteins to 
enhance the stability of a protein. They used a comparison of 
proteases from thermophilic with homologous ones of mesophilic 
organisms to enhance the stability of a mesophilic protease. 
Serrano et al (1993) used the comparison of the amino acid 

10 sequences of two homologous mesophilic RNases to construct a 
more thermostable Rnase. They mutated individually all of the 
residues that differ between the two and combined the mutations 
that increase the stability in a multiple mutant. Pantoliano et 
al (1989) and, in particular, Steipe et al (1994) suggested that 

15 the most frequent amino acid at every position of an alignment 
of homologous proteins contribute to the largest amount to the 
stability of a protein. Steipe et al (1994) proved this for a 
variable domain of an immunoglobulin, whereas Pantoliano et al 
(1989) looked for positions in the primary sequence of 

20 subtilisin in which the sequence of the enzyme chosen to be 
improved for higher stability was singularly divergent. Their 
approach resulted in the replacement M50F which increased the Tm 
of subtilisin by 1.8 *C- 

Steipe et al. (1994) proved on a variable domain of 

25 immunoglobulin that it is possible to predict a stabilizing 
mutation with better than 60% success rate just by using a 
statistical method which determines the most frequent amino acid 
residue at a certain position of this domain- It was also 
suggested that this method would provide useful results not only 

30 for stabilization of variable domains of antibodies but also for 
domains of other proteins. However, it was never mentioned that 
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this method could be extended to the entire protein. 
Furthermore, nothing is said about the program which was used to 
calculate the frequency of amino acid residues at a distinct 
position or whether scoring matrices were used as in the present 
5 case. 

It is therefore an object of the present invention to 
provide a process for the preparation of a consensus protein 
comprising a process to calculate an amino acid residue for 
nearly all positions of a so-called consensus protein and to 
10 synthesize a complete gene from this sequence that could be 
expressed in a pro- or eukaryotic expression system. 

DNA sequences of the present invention can be constructed 
starting from genomic or cDNA sequences coding for proteins, 
e.g. phytases known in the art [for sequence information see 
15 references mentioned above, e.g. 

EP 684 313 or sequence data bases, for example like 
Genbank (Intelligenetics, California, USA) , European 
Bioinformatics Institute (Hinston Hall, Cambridge, GB) , NBRF 
(Georgetown University, Medical Centre, Washington DC, USA) and 
20 Vecbase (University of Wisconsin, Biotechnology Centre, Madison, 
Wisconsin, USA) or disclosed in the figures by methods of in 
vitro mutagenesis [see e.g. Sambrook et al.. Molecular Cloning, 
Cold Spring Harbor Laboratory Press, New York] . A widely used 
strategy for "site directed mutagenesis", as originally outlined 
25 by Hurchinson and Edgell [J. Virol. 8, 181 (1971)], involves the 
annealing of a synthetic oligonucleotide carrying the desired 
nucleotide substitution to a target region of a single-stranded 
DNA sequence wherein the mutation should be introduced [for 
review see Smith, Annu, Rev. Genet. 19, 423 (1985) and for 
30 improved methods see references 2-6 in Stanssen et al., Nucl . 
Acid Res,, 17, 4441-4454 (1989)]- Another possibility of 
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mutating a given DNA sequence which is also preferred for the 
practice of the present invention is the mutagenesis by using 
the polymerase chain reaction (PGR) , DNA as starting material 
can be isolated by methods known in the art and described e.g. 
5 in Sambrook et al. (Molecular Cloning) from the respective 
strains. For strain information see^ e.g. EP 684 313 or any 
depository authority indicated below. Aspergillus niger [ATCC 
9142], Myceliophthora thermophila [ATCC 48102], Talaromyces 
thermophilus [ATCC 20186] and Aspergillus fumigatus [ATCC 34625] 

10 have been redeposited according to the conditions of the 
Budapest Treaty at the 7\merican Type Culture Cell Collection 
under the following accession numbers: ATCC 74337, ATCC 74340, 
ATCC 74338 and ATCC 74339, respectively. It is however, 
understood that DNA encoding a consensus protein in accordance 

15 with the present invention can also be prepared in a synthetic 
manner as described, e.g. in EP 747 483 or the examples by 
methods known in the art. 

The process of the present invention can preferably be 
used in order to improve the thermostability of the enzyme 

20 phytase. After having constructed different consensus phytase 
sequences it was possible to decide whether single amino acid 
replacements had a positive or a negative effect on the protein 
stability. It is therefore another subject of the present 
invention to improve the thermostability of a phytase . 

25 In this embodiment single amino acids are changed in the 

sequence of the phytase by the introduction of at least one 
mutation selected from the group consisting of 
E58A F54Y 
D69K I73V 

30 D197N K94A 

T214L RlOlA 
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E222T N153K 
E267D V158I 
R291I A203G 
R329H S205G 
5 S364T V217A 

A379K A227V 
G404A V234L 
P238A 
Q277E 

10 A287H 

A2 92Q 
V366I 
A396S 
E415Q 

15 G437A 

E451R 



In the above-given mutations the number represents the 
position in the consensus phytase-l-sequence as given in Figure 

20 2 and the letter before the number represents the amino acid in 
the phytase which is replaced by the respective amino acid 
behind the number. The numbers given correspond to the consensus 
phytase sequence or relate to a corresponding residue calculated 
by an alignment as shown in Figure 1 when 26 amino acids (signal 

25 sequence) are added to the sequences shown in Fig. 1. Those 
mutations can be introduced into consensus sequences or into 
sequences of specific enzymes which have been improved by a 
process of the present invention. The above-mentioned amino acid 
replacements have a positive effect on the protein stability. 

30 Once complete DNA sequences of the present invention have 

been obtained they can be integrated into vectors by methods 
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known in the art and described e.g. in Sambrook et al. (s.a.) to 
overexpress the encoded polypeptide in appropriate host systems. 
However, a man skilled in the art knows that also the DNA 
sequences themselves can be used to transform the suitable host 
5 systems of the invention to get overexpression of the encoded 
polypeptide. Appropriate host systems are for example fungi, 
like Aspergilli, e.g. Aspergillus niger [ATCC 9142] or 
Aspergillus ficuum [NRRL 3135] or like Trichoderma, e.g. 
Trichoderma reesei or yeasts, like Saccharomyces, e.g. 

10 Saccharomyces cerevisiae or Pichia, like Pichia pastoris, or 
Hansenula polymorpha, e.g, H. polymorpha {DSM5215) or plants, as 
described, e.g. by Pen et al., Bio/Technology 11, 811-814 
(1994). A man skilled in the art knows that such microorganisms 
are available from depository authorities, e.g. the American 

15 Type Culture Collection (ATCC) , the Centraalbureau voor 
Schimmelcultures (CBS) or the Deutsche Sammlung fur 
Milcroorganismen und Zellkulturen GmbH (DSM) or any other 
depository authority as listed in the Journal "Industrial 
Property" [(1991) 1, pages 29-40]. Bacteria which can be used 

20 are e.g. E. coli. Bacilli as, e.g. Bacillus subtilis or 
Streptomyces, e.g. Streptomyces lividans (see e.g. Ann6 and 
Mallaert in FEMS Microbiol. Letters 114, 121 (1993). E. coli, 
which could be used are E. coli K12 strains e.g. M15 [described 
as DZ 291 by Villarejo et al. in J. Bacteriol. 120, 466-474 

25 (1974)], HB 101 [ATCC No. 33694] or E. coli SG13009 [Gottesman 
et al., J. Bacteriol. 148, 265-273 (1981)], 

Vectors which can be used for expression in fungi are 
known in the art and described e.g. in EP 420 358, or by Cullen 
et al. [Bio/Technology 5, 369-376 (1987)] or Ward in Molecular 

30 Industrial Mycology, Systems and Applications for Filamentous 
Fungi, Marcel Dekker, New York (1991), Upshall et al . 
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[Bio/Technology 5, 1301-1304 (1987)] Gwynne et al. 
[Bio/Technology 5, 71-79 (1987)], Punt et al. [J. Biotechnol. 
17, 19-34 (1991)] and for yeast by Sreekrishna et al . [J. Basic 
Microbiol. 28, 265-278 (1988), Biochemistry 28, 4117-4125 
5 (1989)], Hitzemann et al. [Nature 293, 717-722 (1981)] or in 

EP 183 070, EP 183 071, EP 248 227, EP 263 311. Suitable 
vectors which can be used for expression in E. coli are 
mentioned, e.g. by Sambrook et al. [s.a.] or by Fiers et al. in 
Procd. 8th Int. Biotechnology Symposium" [Soc. Franc. de 
10 Microbiol., Paris (Durand et al., eds.), pp. 680-697 (1988)] or 
by Bujard et al. in Methods in Enzymology, eds. Wu and 
Grossmann, Academic Press, Inc. Vol. 155, 416-433 (1987) and 
StUber et al. in Immunological Methods, eds. Lefkovits and 
Pernis, Academic Press, Inc., Vol. IV, 121-152 (1990). Vectors 
15 which could be used for expression in Bacilli are known in the 
art and described, e.g. in EP 405 370, Procd. Natl. Acad. Sci. 
USA 81, 439 (1984) by Yansura and Henner, Meth. Enzymol. 185, 
199-228 (1990) or EP 207 459. Vectors which can be used for the 
expression in H. Polymorpha are known in the art and described, 
20 e.g. in Gellissen et al.. Biotechnology 9, 291-295 (1991). 

Either such vectors already carry regulatory elements, 
e.g. promoters, or the DNA sequences of the present invention 
can be engineered to contain .such elements . Suitable promoter 
elements which can be used are known in the art and are, e.g. 
25 for Trichoderma reesei the cbhl- [Haarki et al.. Biotechnology 
7, 596-600 (1989)] or the pkil-promotor [Schindler et al.. Gene 
130, 271-275 (1993)], for Aspergillus oryzae the amy-promotor 
[Christensen et al., Abstr, 19th Lunteren Lectures on Molecular 
Genetics F23 (1987), Christensen et al.. Biotechnology 6, 1419- 
30 1422 (1988), Tada et al., Mol. Gen. Genet. 229, 301 (1991)], for 
Aspergillus niger the glaA- [Cullen et al., Bio/Technology 5, 
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369-376 (1987), Gwynne et al., Bio/Technology 5, 713-719 (1987), 
Ward in Molecular Industrial Mycology, Systems and Applications 
for Filamentous Fungi, Marcel Dekker, New York, 83-106 (1991)], 
alcA- [Gwynne et al., Bio/Technology 5, 718-719 (1987)], sucl- 
5 [Boddy et al., Curr. Genet. 24, 60-66 (1993)], aphA- [MacRae et 
al.. Gene 71, 339-348 (1988), MacRae et al.. Gene 132, 193-198 
(1993)], tpiA- [McKnight et al.. Cell 46, 143-147 (1986), 
Upshall et al., Bio/Technology 5, 1301-1304 (1987)], gpdA- [Punt 
et al.. Gene 69, 49-57 (1988), Punt et al., J. Biotechnol . 17, 

10 19-37 (1991)] and the pkiA-promotor [de Graaff et al., Curr. 
Genet. 22, 21-27 (1992)]. Suitable promotor elements which could 
be used for expression in yeast are known in the art and are, 
e.g. the pho5-promotor [Vogel et al., Mol . Cell. Biol., 2050- 
2057 (1989); Rudolf and Hinnen, Proc. Natl. Acad. Sci. 84, 1340- 

15 1344 (1987)] or the gap-promotor for expression in Saccharomyces 
cerevisiae and for Pichia pastoris, e.g. the aoxl-promotor 
[Koutz et al.. Yeast 5, 167-177 (1989); Sreekrishna et al., J. 
Basic Microbiol. 28, 265-278 (1988)], or the FMD promoter 
[Hollenberg et al., EPA No. 0299108] or MOX-promotor [Ledeboer 

20 et al.. Nucleic Acids Res. 13, 3063-3082 (1985)] for H. 
polymorpha . 

Accordingly vectors comprising DNA sequences of the 
present invention, preferably for the expression of said DNA 
sequences in bacteria or a fungal or a yeast host and such 

25 transformed bacteria or fungal or yeast hosts are also an object 
of the present invention. 

It is also an object of the present invention to provide a 
system which allows for high expression of proteins, preferably 
phytases like the consensus phytase of the present invention in 

30 Hansenula characterized therein that the codons of the encoding 
DNA sequence of such a protein have been selected on the basis 
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of a codon frequency table of the organism used for expression, 
e.g. yeast as in the present case (see e.g. in Example 3) and 
optionally the codons for the signal sequence have been selected 
in a manner as described for the specific case in Example 3 . 
5 That means that a codon frequency table is prepared on the basis 
of the codons used in the DNA sequences which encode the amino 
acid sequences of the defined protein family. Then the codons 
for the design of the DNA sequence of the signal sequence are 
selected from a codon frequency table of the host cell used for 

10 expression whereby always codons of comparable frequency in both 
tables are used. 

Once such DNA sequences have been expressed in an 
appropriate host cell in a suitable medium the encoded protein 
can be isolated either from the medium in the case the protein 

15 is secreted into the medium or from the host organism in case 
such protein is present intracellularly by methods known in the 
art of protein purification or described in case of a phytase, 
e.g. in EP 420 358. Accordingly a process for the preparation of 
a polypeptide of the present invention characterized in that 

20 transformed bacteria or a host cell as described above is 
cultured under suitable culture conditions and the polypeptide 
is recovered therefrom and a polypeptide when produced by such a 
process or a polypeptide encoded by a DNA sequence of the 
present invention are also an object of the present invention. 

25 Once obtained the polypeptides of the present invention 

can be characterized regarding their properties which make them 
useful in agriculture any assay known in the art and described 
e.g. by Simons et al. [Br, J. Nutr. 64, 525-540 (1990)], Schoner 
et al. [J. Anim. Physiol, a. Anim. Nutr. 66, 248-255 (1991)], 

30 Vogt [Arch. Geflugelk. 56, 93-98 (1992)], Jongbloed et al. [J. 
Anim. Sci . , 70, 1159-1168 (1992)], Perney et al. [Poultry Sci. 
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72, 2106-2114 (1993)], Farrell et al,, [J. Anim. Physiol, a. 
Anim. Nutr, 69, 278-283 (1993), Broz et al., [Br. Poultry Sci. 
35, 273-280 (1994)] and Dungelhoef et al. [Animal Feed Sci. 
Technol- 49, 1-10 (1994)] can be used. 
5 In general the polypeptides of the present invention can 

be used without being limited to a specific field of 
application, e.g. in case of phytases for the conversion of 
inositol polyphosphates, like phytate to inositol and inorganic 
phosphate . 

10 Furthermore the polypeptides of the present invention can 

be used in a process for the preparation of a pharmaceutical 
composition or compound food or feeds wherein the components of 
such a composition are mixed with one or more polypeptides of 
the present invention. Accordingly compound food or feeds or 

15 pharmaceutical compositions comprising one or more polypeptides 
of the present invention are also an object of the present 
invention. A man skilled in the art is familiar with their 
process of preparation. Such pharmaceutical compositions or 
compound foods or feeds can further comprise additives or 

20 components generally used for such purpose and known in the 
state of the art. 

It is furthermore an object of the present invention to 
provide a process for the reduction of levels of phytate in 
animal manure characterized in that an animal is fed such a feed 

25 composition in an amount effective in converting phytate 
contained in the feedstuff to inositol and inorganic phosphate. 

In the present context, a phytase is an enzyme or 
polypeptide that has phytase activity. The phytase is preferably 
purified, viz. at least 85%, preferably at least 86, 87, 88, 89, 

30 90, 91, 92, 93, 94, 95, 96, 97, 98, 99% pure. The phytase is 
preferably isolated. Phytase activity can be determined using 
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any phytase assay known in the art. A preferred assay is the so- 
called standard assay herein (see Example 9) . A preferred assay 
temperature is the optimum temperature of the actual phytase, 
and a preferred assay pH is the optimum pH of the actual 
5 phytase. A preferred assay is described in Example 9 herein, 
Another preferred assay is the FYT assay of example 15 of WO 
98/28409, hereby incorporated by reference. 

In preferred embodiments, the assay temperature is 
selected within the range of 20-90*^0, more preferably 30-80^C, 

10 still more preferably 35-75°C. Preferred assay temperatures are 
37°C, 50*^0, 60°C, and 70^C. 

In further preferred embodiments, the assay pH is selected 
within the range of pH 2-9, more preferably 3-8, still more 
preferably 3-6. Preferred assay pH values are 3, 4, 5, 6 and 7. 

15 Amino acid sequence homology (or polypeptide or amLno acid 

homology) is determined as the degree of identity between two 
sequences. This may suitably be determined by means of computer 
programs known in the art such as GAP provided in the GCG 
program package (Program Manual for the Wisconsin Package, 

20 Genetics Computer Group, 575 Science Drive, Madison, Wisconsin, 
USA 53711), see also Needleman, S.B. and Wunsch, CD., (1970), 
Journal of Molecular Biology, 48, 443-453, hereby incorporated 
by reference. In release 9,1, for comparing polypeptide 
sequences, the Length Weight is set to 0, and the Gap Weight is 

25 set to 3.0. 

The degree of identity or homology between two DNA 
(nucleic acid) sequences may be determined by means of computer 
programs known in the art such as GAP provided in the GCG 
program package (Program Manual for the Wisconsin Package, 
30 Genetics Computer Group, 575 Science Drive, Madison, Wisconsin, 
USA 53711), see also Needleman, S.B. and Wunsch, CD., (1970), 
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Journal of Molecular Biology, 48, 443-453, hereby incorporated 
by reference. In release 9.1, GAP is used with the following 
settings for DNA sequence comparison: GAP creation penalty of 50 
and GAP extension penalty of 3. 
5 Suitable experimental conditions for determining whether a 

given DNA or RNA sequence hybridizes to a specified nucleotide 
or oligonucleotide probe involves presoaking of the filter 
containing the DNA or RNA fragments to examine for hybridization 
in 5 X SSC (Sodium chloride/Sodium citrate), (J. Sambrook, E.F. 

10 Fritsch, and T. Maniatis, 1989, Molecular Cloning, A Laboratory 
Manual, 2d edition. Cold Spring Harbor, New York) for 10 min, 
and prehybridization of the filter in a solution of 5 x SSC, 5 x 
Denhardt's solution (Sambrook et al. 1989), 0.5 % SDS and 100 
]ag/ml of denatured sonicated salmon sperm DNA (Sambrook et al. 

15 1989), followed by hybridization in the same solution containing 
a concentration of 10 ng/ml of a random-primed (Feinberg, A. P. 
and Vogelstein, B. (1983) Anal. Biochem. 132:6-13), 32P-dCTP- 
labeled (specific activity > 1 x 109 cpm/pg) probe for 12 hours 
at approximately 45°C. 

20 The filter is then washed twice for 30 minutes in 2 x SSC, 

0.5 % SDS at at least 55**C (low stringency), at at least 60°C 
(medium stringency) , at at least 65*'C (medium/high stringency) , 
at at least 70°C (high stringency), or at at least 15°C (very 
high stringency) . 

25 Molecules to which the oligonucleotide probe hybridizes 

under these conditions are detected using an x-ray film. 

Before describing the present invention in more detail a 
short explanation of the Figures enclosed is given below. 
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Figure 1 : Design of the consensus phytase sequence. The 
letters represent the amino acid residues in the one-letter 
code. The following sequences were used for the alignment: phyA 
from Aspergillus terreus 9A-1 (Mitchell et al, 1997; from amino 
5 acid (aa) 21), phyA from A. terreus cbsll6.46; (van Loon et al., 
1998; from aa 21), phyA from Aspergillus niger var, awamori 
(Piddington et al, 1993; from aa 21), phyA from A. niger T213; 
from aa 27*) , phyA from A. niger strain NRRL3135 (van 
Hartingsveldt et al, 1993; from aa 27), phyA from Aspergillus 

10 fumigatus ATCC 13073 (Pasamontes et al, 1993; from aa 25), phyA 
from A- fumigatus ATCC 32722 (van Loon et al, 1998; from aa 27), 
phyA from A. fumigatus ATCC 58128 (van Loon et al., 1998; from 
aa 27), phyA from A. fumigatus ATCC 26906 (van Loon et al, 1998; 
from aa 27), phyA from A, fumigatus ATCC 32239 (van Loon et al, 

15 1998; from aa 30), phyA from Emericella nidulans (Pasamontes et 
al, 1997a; from aa 25), phyA from Talaromyces thermophilus 
(Pasamontes et al, 1997a; from aa 24), and phyA from 
Myceliophthora thermophila (Mitchell et al, 1997; from aa 19) , 
The alignment was calculated using the program PILEUP. The 

20 location of the gaps was refined by hand. Capitalized amino acid 
residues in the alignment at a given position belong to the 
amino acid coalition that establish the consensus residue. In 
bold, beneath the calculated consensus sequence, the amino acid 
sequence of the finally constructed consensus phytase (Fcp) is 

25 shown. The gaps in the calculated consensus sequence were filled 
by hand according to principals stated in Example 1. 

Figure 2 : DNA sequence of the consensus phytase-1 gene 
(fcp) and of the primers used for the gene construction- The 
30 calculated amino acid sequence (Figure 1) was converted into a 
DNA sequence using the program BACKTRANSLATE (Devereux et al.. 
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1984) and the codon frequency table of highly expressed yeast 
genes (GCG program package, 9.0). The signal peptide of the 
phytase from A. terreus cbs. 116.46 was fused to the N-terminus. 
The bold bases represent the sequences of the oligonucleotides 
5 used to generate the gene. The names of the respective 
oligonucleotides are alternately noted above or below the 
sequence. The underlined bases represent the start and stop 
codon of the gene. The bases written in italics show the two 
introduced Eco RI sites. 

10 

Figure 3 : Alignment and consensus sequence of five 
Basidiomycetes phytases. The letters represent the amino acid 
residues in the one-letter code. The amino acid sequences of the 
phytases from Paxillus involutus, phyAl (aa 21) and phyA2 (aa 

15 21, WO 98/28409), Trametes pubescens (aa 24, WO 98/28409), 
Agrocybe pediades (aa 19, WO 98/28409), and Peniophora lycii (aa 
21, WO 98/28409) starting with the amino acid residues mentioned 
in parentheses, were used for the alignment and the calculation 
of the corresponding consensus sequence called "Basidio" 

20 (Example 2) . The alignment was performed by the program PILEPUP. 
The location of the gaps was refined by hand. The consensus 
sequence was calculated by the program PRETTY. While a vote 
weight of 0.5 was assigned to the two P. involutus phytases, all 
other genes were used with a vote weight of 1.0 for the 

25 consensus sequence calculation. At positions, where the program 
was not able to determine a consensus residues, the Basidio 
sequence contains a dash. Capitalized amino acid residues in the 
alignment at a given position belong to the amino acid coalition 
that establish the consensus residue. 

30 
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Figure 4 : Design of consensus phytase-10 amino acid 
sequence . Adding the phytase sequence of Thermomyces lanuginosa 
(Berka et al . , 1998) and the consensus sequence of the phytase s 
from five Basidiomycetes to the alignment of Figure 1 , an 
5 improved consensus sequence was calculated by the program 
PRETTY. Additionally, the amino acid sequence of A, niger T213 
was omitted, therefore, using a vote weight of 0.5 for the 
remaining A. niger phytase sequences. For further information 
see Example 2, 

10 

Figure 5 : DNA and amino acid sequence of consensus 
phytase-10. The amino acid sequence is written above the 
corresponding DNA sequence using the one-letter code. The 
sequence of the oligonucleotides which were used to assemble 

15 the gene are in bold letters . The label of oligonucleotides and 
the amino acids, which were changed compared to those for 
consensus phytase-1 , are underlined and their corresponding 
triplets are highlighted in small cases. The fcplO gene was 
assembled from the following oligonucleotides : CP-1 , CP-2, CP- 

20 3.10, CP-4.10, CP-5.10, CP-6, CP-7.10, CP-8.10, CP-9.10, CP- 
10.10, CP-11.10, CP-12.10, CP-13.10, CP-14.10, CP-15,10, CP- 
16.10, CP-17.10, CP18.10, CP-19,10, CP-20.10, CP-21.10, CP- 
22.10. The newly synthesized oligonucleotides are additionally 
marked by number 10 . The phytase contains the following 32 

25 exchanges: Y54F, E58A, D69K, D70G, A94K, N134Q, I158V, S187A, 
Q188N, D197N, S204A, T214L, D220E, L234V, A238P, D246H, T251N, 
Y259N, E267D, E277Q, A283D, R291I, A320V, R329H, S364T, I366V, 
A379K, S396A, G404A, Q415E, A437G, A463E- The mutations 
accentuated in bold letters revealed a stabilizing effect on 

30 consensus phytase-1 as tested as single mutation in consensus 
phytase-1 . 
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Figure 6 ; Alignment for the design of consensus 
phytase-11- In contrast to the design of consensus phytase-10, 
for the design of the amino acid sequence of consensus 
5 phytase-11, all Basidiomycetes phytases were used as independent 
sequences using an assigned vote weight of 0.2 for each 
Basidiomycetes sequence- Additionally, the amino acid sequence 
of A. niger T213 was used in that alignment, again. 

10 Figure 7 : DNA and amino acid sequence of consensus 

phytase-l-thermo[8] -Q50T-K91A. The amino acid sequence is 
written above the corresponding DNA sequence using the one- 
letter code- The replaced amino acid residues are underlined. 
The stop codon of the gene is marked by a star (*) . 

15 

Figure 8 : DNA and amino acid sequence of consensus 
phytase-10-thermo[3] -Q50T-K91A. The amino acid sequence is 
written above the corresponding DNA sequence using the one- 
letter code- The replaced amino acid residues are underlined, 
20 The stop codon of the gene is marked by a star (*) . 

Figure 9 : DNA and amino acid sequence of A. fumigatus ATCC 
13073 phytase alpha-mutant. The amino acid sequence is written 
above the corresponding DNA sequence using the one-letter code. 
25 The replaced amino acid residues are underlined. The stop codon 
of the gene is marked by a star {*) . 

Figure 10 : DNA and amino acid sequence of consensus 
phytase-7. The amino acids are written above the corresponding 
30 DNA sequence using the one-letter code. The sequence of the 
oligonucleotides used to assemble the gene are in bold letters. 
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Oligonucleotides and amino acids that were exchanged are 
underlined and their corresponding triplets are highlighted in 
small cases. The fcp7 gene was assembled from the following 
oligonucleotides: CP-1, CP-2, CP-3, CP-4.7, CP-5.7, CP-6, CP-7, 
5 CP-8.7, CP-9, CP-10.7, CP-11.7, CP-12.7, CP-13.7, CP-14.7, CP- 
15.7, CP-16, CP-17.7, CP-18.7, CP-19.7, CP-20, CP-21, CP-22. The 
newly synthesized oligonucleotides are additionally marked by 
number 7. The phytase contains the following 24 exchanges in 
comparison to the original consensus phytase: S89D, S92G, A94K, 
10 D164S, P201S, G203A, G205S, H212P, G224A, D226T, E255T, D256E, 
V258T, P265S, Q292H, G300K, Y305H, A314T, S364G, M365I, A397S, 
S398A, G404A, and A405S. 



Figure 11: Differential scanning calorimetry (DSC) of 
15 consensus phytase-1 and consensus phytase-10. The protein 
samples were concentrated to ca. 50-60 mg/ml and extensively 
dialyzed against 10 mM sodium acetate, pH 5.0. A constant 
heating rate of iCc/min was applied up to 95''C. DSC of 
consensus phytase-10 (upper graph) yiel.ded a melting temperature 
20 of 85.4''C, which is 7.3°C higher than the melting point of 
consensus phytase-1 (78.1''C, lower graph). 

Figure 12: Differential scanning calorimetry (DSC) of 

consensus phytase-10-thermo-Q50T and consensus phytase-10- 

25 thermo-Q50T-K9lA. The protein samples were concentrated to ca. 
50-60 mg/ml and extensively dialyzed against 10 mM sodium 
acetate, pH 5.0. A constant heating rate of lO^C/min was applied 
up to gs'C. DSC of consensus phytase-10-thermo-Q50T (upper 
graph) yielded a melting temperature of 88.6*"C, while the 

30 melting point of consensus phytase-10-thermo-Q50T-K9lA was found 
at 89.3''C. 
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Figure 13 : Comparison of the temperature optimum between 
consensus phytase-1, consensus phytase-10 and consensus phytase- 
10-thermo-Q50T. For the determination of the temperature 
5 optimum, the phytase standard assay was performed at a series of 
temperatures between 37 and 86°C. The diluted supernatant of 
transformed S, cerevisiae strains was used for the 
determination. The other components of the supernatant showed no 
influence on the determination of the temperature optimum: a, 

10 consensus phytase-1 ; , consensus phytase-10 ; consensus 
phytase 10-thermo-Q50T . 

Figure 14 : pH-dependent activity profile and substrate 
specificity of consensus phytase-10 and its variants thermo-Q50T 
15 and thermo-Q50T-K9lA. The phytase activity was determined using 
the standard assay in appropriate buffers (see Example 9) at 
different pH- values . Graph a) shows the pH-dependent activity 
profile of consensus phytase-10 (□) , consensus phytase- 10- 

thermo-Q50T ( • ) , and consensus phytase-10-thermo-Q50T-K91A (a) . 

20 Graph b) shows the corresponding substrate specificity tested by 
replacement of phytate by the indicated compounds in the 
standard assay; open bars, consensus phytase-10 (grey bars, 
consensus phytase-10-thermo-Q50T; dark bars, consensus phytase- 
10-thermo-Q50T-K91A) . The numbers correspond to the following 

25 compounds: 1, phytate; 2, p-nitrophenyl phosphate; 3, phenyl 
phosphate; 4, f ructose-1, 6-bisphosphate; 5, fructose-6- 
phosphate; 6, glucose-6-phosphate; 7, ribose-5-phosphate; 8, DL- 
glycerol-3-phosphate; 9, glycerol-2-phosphate; 10, 3- 
phosphoglycerate; 11, phosphoenolpyruvate; 12, AMP; 13, ADP; 14, 

30 ATP. 
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Figure 15 : pH-dependent activity profile and substrate 
specificity of consensus phytase-l-thermo [8] -Q50T and of 
consensus phytase-l-thermo [ 8] -Q50T-K91A. The phytase activity 
5 was determined using the standard assay in appropriate buffers 
(see Example 9) at different pH-values. Graph a) shows the pH- 
dependent activity profile of the Q50T- (■). and the Q50T-K91A- 

variant ( . ) • Graph b) shows the corresponding substrate 
specificities tested by replacement of phytate by the indicated 
10 compounds in the standard assay (open bars, consensus phytase-l- 
thermo [8] -Q50T; filled bars, consensus phytase-l-thermo [8 ] -Q50T- 
K91A) . The substrates are listed in the legend of Figure 14. 

Figure 16 : Differential scanning calorimetry (DSC) of 
15 consensus phytase-l-thermo [8 ] -Q50T and consensus phytase-l- 
thermo [8] -Q50T-K91A. The protein samples were concentrated to 
ca. 50-60 mg/ml and extensively dialyzed against 10 mM sodium 
acetate, pH 5.0. A constant heating rate of 10°C/min was applied 
up to 95°C. DSC of consensus phytase-l-thermo [ 8] -Q50T (upper 
20 graph) showed a melting temperature of 84-7°C, while the melting 
point of consensus phytase-l-thermo [8 ] -Q50T-K91A was found at 
85,7°C- 

Figure 17 : Comparison of the temperature optimum between 
25 consensus phytase-1, consensus phytase-l-thermo [ 3] and consensus 
phytase-l-thermo [8] . For the determination of the temperature 
optimum, the phytase standard assay was performed at a series of 
temperatures between 37 and .86°C. Purified protein from the 
supernatant of transformed S. cerevisiae strains was used for 
30 the determination. O, consensus phytase-1; consensus 

phytase-l-thermo [3] ; ^, consensus phytase l-thermo[8]. 
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Figure 18 : Comparison of the pH-dependent activity profile 
and substrate specificity of consensus phytase-1, consensus 
phytase-7, and of the phytase from A. niger NRRL 3135. The 
5 phytase activity was determined using the standard assay in 
appropriate buffers (see Example 9) at different pH-values, 
Graph a) shows the pH-dependent activity profile of consensus 
phytase-1 (■), the phytase from A. niger NRRL 3135 (O) , and of 

consensus phytase-7 (^). Graph b) shows the corresponding 
10 substrate specificity tested by replacement of phytate by the 
indicated compounds in the standard assay (black bars. A, niger 
NRRL 3135 phytase; grey bars, consensus phytase-1, dashed bars, 
consensus phytase-7). The substrates are listed in the legend of 
Figure 14. 

15 

Figure 19 : Differential scanning calorimetry (DSC) of the 
phytase from A. fumigatus ATCC 13073 and of its stabilized 
alpha-mutant, which contains the following amino acid exchanges 
F55Y, VIOOI, F114Y, A243L, S265P, N294D. 

20 The protein samples were concentrated to ca , 50-60 mg/ml 

and extensively dialyzed against 10 mM sodium acetate, pH 5.0. A 
constant heating rate of 10°C/min was applied up to 95°C, DSC of 
consensus A. fumigatus 13073 phytase (upper graph) revealed a 
melting temperature of 62.5*'C, while the melting point of the 

25 alpha-mutant was found at 67.0**C. 

Figure 20 ; Comparison of the temperature optimum of A. 
fumigatus 13073 wild-type, its A. fumigatus alpha-mutant, and a 
further stabilized alpha-mutant (E59A-S154N-R329H-S364T-G404A) . 
30 For the determination of the temperature optimum, the phytase 
standard assay was performed at a series of temperatures between 
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37 and 75**C. The diluted supernatant of transformed S. 
cerevisiae strains was used for the determination. The other 
components of the supernatant showed no influence on the 
determination of the temperature optimum. O, A, fumigatus ATCC 

5 13073 phytase; ^, A. fumigatus ATCC 13073 alpha-mutant; A. 
fumigatus ATCC 13073 alpha-mutant- (E59A-S154N-R329H-S364T- 
G404A)-Q27T; A. fumigatus ATCC 13073 alpha-mutant- (E59A- 

S154N-R329H-S364T-G404A) -Q27T-K68A. Q27T and K68A corresponds to 
consensus phytase-1 Q50T and K91A, respectively. 

10 

Figure 21 : T^ino acid sequence of consensus phytase 12 
(consphyl2) which contains a number of active site residues 
transferred from the "basidio" consensus sequence to consensus 
phytase-10-thermo-Q50T-K91A. 

15 

Example 1 

Consensus phytase-l 

The amino acid sequence of consensus phytase-1 (fungal 
consensus phytase, fcp) was designed and calculated as described 

20 in Examples 1-2 of EP 0897985. Table 1 below shows the origin 
and vote weight of the phytase amino acid sequences used for the 
design of consensus phytase-1. The consensus phytase-1 sequence 
was furthermore converted into a DNA sequence as described in 
Example 3 of EP 0897985, and the consensus phytase-1 gene was 

25 constructed and cloned as described in Example 4 of EP 0897985. 
EP 0897985 is hereby incorporated by reference. 

Table 1 

Origin and vote weight of the phytase amino acid sequences 
30 - phyA from Aspergillus terreus 9A-1, aa 27, vote weight 0.5 
(Mitchell et al., 1997) 
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- phyA from Aspergillus terreus cbsll6.46, aa 27, vote weight 
0.5 (van Loon et al., 1998) 

- phyA from Aspergillus niger var. awamori, aa 21, vote weight 
0.33 (Piddington et al., 1993) 

5 - phyA from Aspergillus niger T213, aa 27, vote weight 0.33 

- phyA from Aspergillus niger strain NRRL3135, aa 27, vote 
weight 0 - 33 (van Hartingsveldt et al . , 1993) 

- phyA from Aspergillus fumigatus ATCC 13073, aa 26, vote 
weight 0.2 (Pasamontes et al., 1997) 

10 - phyA from Aspergillus fumigatus ATCC 32722, aa 26, vote 
weight 0.2 (van Loon et al., 1998) 

- phyA from Aspergillus fumigatus ATCC 58128, aa 26, vote weight 
0.2 (van Loon et al., 1998) 

- phyA from Aspergillus fumigatus ATCC 26906, aa 26, vote 
15 weight 0.2 (van Loon et al., 1998) 

- phyA from Aspergillus fumigatus ATCC 32239, aa 30, vote weight 
0.2 (van Loon et al . , 1998) 

- phyA from Emericella nidulans , aa 25, vote weight 1.0 (Roche 
Nr. R1288, Pasamontes et al., 1997a) 

20 - phyA from Talaromyces thermophilus ATCC 20186, aa 24, vote 
weight 1.0 (Pasamontes et al., 1997a) 

- phyA from Myceliophthora thermophila, aa 19, vote weight 1.0 
(Mitchell et al . , 1997) 

25 Example 2 

Design of an improved consensus phytase (consensus 
phytase-10) amino acid sequence 

The alignments used for the design of consensus phytase-10 
30 were calculated using the program PILEUP from the Sequence 
Analysis Package Release 9.0 (Devereux et al., 1984) with the 
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standard parameter (gap creation penalty 12, gap extension 
penalty 4). The location of the gaps was refined using a text 
editor - 

The following sequences were used for the alignment of the 
5 Basiodiomycetes phytases starting with the amino acid (aa) 
mentioned in Table 2: 

Table 2 

Origin and vote weight of five Basidiomycetes phytases used for 
10 the calculation of the corresponding amino acid consensus 
sequence (basidio) 

- phyAl from Paxillus involutus NN005693, aa 21, vote weight 0.5 
(WO 98/28409) 

15 - phyA2 from Paxillus involutus NN005693, aa 21, vote weight 0.5 
(WO 98/28409) 

- phyA from Trametes pubescens NN9343, aa 24, vote weight 1.0 
(WO 98/28409) 

- phyA from Agrocybe pediades NN009289, aa 19, vote weight 1.0 
20 (WO 98/28409) 

- phyA from Peniophora lycii NN006113, aa 21, vote weight 1.0 
(WO 98/28409) 

The alignment is shown in Figure 3. 

25 

In Table 3 the genes, which were used for the performance 
of the final alignment, are arranged. The first amino acid (aa) 
of the sequence which is used in the^ alignment is mentioned 
behind the organism designation. 

30 

Table 3 
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Origin and vote weight of the phytase sequences used for the 
design of consensus phytase 10 

- phyA from Aspergillus terreus 9A-1, aa 27, vote weight 0.5 
5 (Mitchell et al., 1997) 

- phyA from Aspergillus terreus cbsll6,46, aa 27, vote weight 
0.5 (van Loon et al., 1998) 

- phyA from Aspergillus niger var. awamori, aa 27, vote weight 
0.5 (Piddington et al., 1993) 

10 - phyA from Aspergillus niger strain NRRL3135, aa 27, vote 
weight 0.5 (van Hartingsveldt et al., 1993) 

- phyA from Aspergillus fumigatus ATCC 13073, aa 26, vote 
weight 0.2 (Pasamontes et al., 1997) 

- phyA from Aspergillus fumigatus ATCC 32722, aa 26, vote 
15 weight 0.2 (van Loon et al., 1998) 

- phyA from Aspergillus fumigatus ATCC 58128, aa 26, vote weight 
0.2 (van Loon et al., 1998) 

- phyA from Aspergillus fumigatus ATCC 26906, aa 26, vote weight 
0.2 (van Loon et al., 1998) 

20 - phyA from Aspergillus fumigatus ATCC 32239, aa 30, vote weight 
0.2 (van Loon et al., 1998) 

- phyA from Emericella nidulans , aa 25, vote weight 1.0 (Roche 
Nr. R1288, Pasamontes et al., 1997a) 

- phyA from Talaromyces thermophilus ATCC 20186, aa 24, vote 
25 weight 1.0 (Pasamontes et al., 1997a) 

- phyA from Myceliophthora thermophila, aa 19, vote weight 1.0 
(Mitchell et al., 1997) 

- phyA from Thermomyces lanuginosa, aa 36, vote weight 1.0 
(Berka et al., 1998) 

30 - Consensus sequence of five Basidiomycetes phytases, vote 
weight 1.0 (Basidio, Figure 3) 
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The corresponding alignment is shown in Figure 4 . 

Calculation of the amino acid sequence of consensus-lO 
5 To improve the alignment, we added the original consensus 

sequence of five phytases from four different Basidiomycetes, 
called Basidio, still containing the undefined sequence 
positions (see Figure 3) , nearly all phytase sequences used for 
calculation of the original consensus phytase and one new 

10 phytase sequence from the Ascomycete Thermomyces lanuginosa to a 
larger alignment . Using the consensus sequence of the 
basidiomycetal phytase sequences, does not pay regard to the 
diversity among the five amino acid sequences, but pays regard 
to the common and different amino acid residues between the 

15 phytases from the Ascomycetes and the Basidiomycetes. 

We set plurality on 2.0 and threshold on 3. The used vote 
weight are listed in Table 3. The alignment and the 
corresponding consensus sequence is presented in Figure 4 . The 
new consensus phytase sequence has 32 different amino acids in 

20 comparison to the original consensus phytase. Positions for 
which the program PRETTY was not able to calculate a consensus 
amino acid residue were filled according to rules mentioned in 
Example 1. None of the residues suggested by the program was 
replaced - 

25 Furthermore, we included all Basidiomycetes phytases as 

single amino acid sequences but assigning a vote weight of 0.2 
in the alignment. The corresponding alignment is shown in Figure 
6. The calculated consensus . amino acid sequence (consensus 
phytase-11) has the following differences to the sequence of 

30 consensus phytase-10. Letter X means that the program was not 
able to calculate a consensus amino acid; the amino acid in 
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parenthesis corresponds to the amino acid finally included into 
the consensus phytase-10. 

D35X, X(K)69K, X(E)100E, AIOIR, Q134N, X(K)153N, X(H)190H, 
X(A)204S, X(E)220D, E222T, V227A, X(R)271R, H287A, X(D)288D, 
5 X(K)379K, X(I)389I, E390X, X(E)415E, X{A)416A, X(R)446L, E463A, 
whereas the numbering is as in Fig. 5. 

We also checked single amino acid replacements suggested 
by the improved consensus sequences 10 and 11 on their influence 
on the stability of the original consensus phytase. The approach 
10 "is described in example 3. 

Conversion of consensus phytase-lO amin o acid sequence to 
a DNA sequence 

The first 26 amino acid residues of A. terreus cbsll6.46 
15 phytase were used as signal peptide and, therefore, fused to the 
N-terminus of consensus phytase-10. The used procedure is 
further described in Example 1 . 

The resulting sequence of the fcplO gene is shown in 

Figure 5. 

20 

Construction and cloning of t h e consensus phytase-10 gene 
(fcplO) 

The calculated DNA sequence of fcplO was divided into 
oligonucleotides of 85 bp, alternately using the sequence of the 
25 sense and the anti-sense strand. Every oligonucleotide overlaps 
20 bp with its previous and its following oligonucleotide of the 
opposite strand. The location of all primers, purchased by 
Microsynth, Balgach (Switzerland) and obtained in a PAGE- 
purified form, is indicated in Figure 5. 

30 

PCR-Reactions 
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In three PGR reactions, the synthesized oligonucleotides 
were composed to the entire gene. For the PGR, the High Fidelity 
Kit from Boehringer Mannheim (Boehringer Mannheim, Mannheim, 
Germany) and the thermo cycler The ProtokolTM from AMS 
5 Biotechnology (Europe) Ltd. (Lugano, Switzerland) was used. The 
following oligonucleotides were used in a concentration of 0,2 
pMol/ml . 

Mix 1.10: CP-1, CP-2, CP-3.10, CP-4.10, GP-5.10, CP-6, 
CP-7,10, CP-8.10, CP-9,10, CP-10.10 
10 Mix 2.10: GP-9.10, CP-11.10, CP-12.10, GP-13.10, CP- 

14.10, CP-15.10, CP-16.10, CP-17,10, CP18.10, 
CP-19-10, CP-20.10, CP-21.10, GP-22.10 
The newly synthesized oligonucleotides are marked by 
number 10. The phytase contains the following 32 exchanges, 
15 which are underlined in Figure 5, in comparison to the original 
consensus phytase: Y54F, E58A, D69K, D70G, A94K, N134Q, I158V, 
S187A, Q188N, D197N, S204A, T214L, D220E, L234V, A238P, D246H, 
T251N, Y259N, E267D, E277Q, A283D, R291I, A320V, R329H, S364T, 
I366V, A379K, S396A, G404A, Q415E, A437G, A463E. 
20 Four short PGR primer were used for the assembling of the 

oligonucleotides : 

GP-a : Eco RI 

5' -TATATGAArrCATGGGGGTGTTGGTG-3' 

25 

GP-b: 

5' -TGAAAAGTTGATTGAAGGTTTG-3' 

GP-c. 10: 

30 5' -TGTTGGAAAGCAGTAGAGAAAC-3' 
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CP-e: Eco RI 

5' -TATATGAArrCTTAAGCGA7U\C-3' 



PGR reaction a: 10 pi Mix 1.10 (2.0 pmol of each 
5 oligonucleotide) 

2 pi nucleotides (10 mM each nucleotide) 

2 pi primer CP-a (10 pmol/ml) 

2 pi primer CP-c.lO (10 pmol/ml) 

10,0 pi PGR buffer 
0 0-75 pi polymerase mixture 

73.25 pi H2O 

PGR reaction b: 10 pi Mix 2.10 (2.0 pmol of each 

oligonucleotide) 
5 2 pi nucleotides (10 mM each nucleotide) 

2 pi primer GP-b (10 pmol/ml) 
2 pi primer CP-e (10 pmol/ml) 
10, 0 pi PGR buffer 

0.75 pi polymerase mixture (2.6 U) 
10 73-25 pi H2O 



Reaction conditions for PGR reaction a and b: 
step 1 2 min - 45°G 

step 2 30 sec - 72°C 

25 step 3 30 sec - 94 °G 

step 4 30 sec - 52 °G 

step 5 1 min - 72**G 



Step 3 to 5 were repeated 40-times. 

30 
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The PGR products (670 and 905 bp) were purified by an 
agarose gel electrophoresis (0.9% agarose) and a following gel 
extraction (QIAEX II Gel Extraction Kit, Qiagen, Hilden, 
Germany) . The purified DNA fragments were used for the PGR 
5 reaction c. 

PGR reaction c: 6 yl PGR product of reaction a w50 ng) 

6 yl PGR product of reaction b «50 ng) 
2 pi primer CP-a (10 pmol/ml) 
10 2 \j1 primer CP-e (10 pmol/ml) 

10,0 yl PGR buffer 

0-75 pi polymerase mixture (2.6 U) 
73.25 pi H2O 

15 Reaction conditions for PGR reaction c: 
step 1 2 min - 94 ^'G 

step 2 30 sec - 94''C 

step 3 30 sec - 55°C 

step 4 1 min - 72 °G 

20 

Step 2 to 4 were repeated 31-times. 

The resulting PGR product (1.4 kb) was purified as 
mentioned above, digested with Eco RI, and ligated in an Eco RI- 

25 digested and dephosphorylated pBsk (-) -vector (Stratagene, La 
Jolla, GA, USA) . 1 pi of the ligation mixture was used to 
transform E. coli XL-1 competent cells (Stratagene, La Jolla, 
CA, USA) . All standard procedures were carried out as described 
by Sambrook et al. (1987). The DNA sequence of the constructed 

30 gene (fcplO) was checked by sequencing as known in the art. 
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Example 3 

Increasing the thermostability of consensus phytase-1 by 
introduction of single mutations suggested by the amino 
acid seguence of consensus phytase-10 and consensus 
5 phytase-ll 

In order to increase the thermostability of homologous 
genes, it is also possible to test the stability effect of each 
differing amino acid residue between the protein of interest and 

10 the calculated consensus sequence and to combine all stabilizing 
mutations into the protein of interest. We used the consensus 
phytase as protein of interest and tested the effect on the 
protein stability of 34 amino acid residues, differing to 
consensus phytase 10 and/or 11 as single mutations. 

15 To construct muteins for expression in A. niger, S. 

cerevisiae, or H. polymorpha, the corresponding expression 
plasmid containing the consensus phytase gene was used as 
template for site-directed mutagenesis (see Example 6-8) . 
Mutations were introduced using the "quick exchangeTM site- 

20 directed mutagenesis kit" from Stratagene ( La Jolla, CA, USA) 
following the manufacturer's protocol and using the 
corresponding primers. All mutations made and their 
corresponding primers are summarized in Table 4. Plasmids 
harboring the desired mutation were identified by DNA sequence 

25 analysis as known in the art. 

Table 4 

Primers used for site-directed mutagenesis of consensus p hytase 
(Exchanged bases are highlighted in bold. The introduction of a 
30 restriction site is marked above the sequence. When a 
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restriction site is written in parenthesis, the mentioned site 
was destroyed by introduction of the mutation,) 



mutation 



Primer set 



10 



15 



Q50T 



Y54F 



E58A 



D69K 



20 D70G 



K91A 



25 



A94K 



30 AIOIR 



N134Q 



35 



K153N 



40 



I158V 



Kpn I 

CACTTGTGGGGTACCTACTCTCCATACTTCTC- 3 ' 
GAGAAGTATGGAGAGTAGFGrACCCCACAAGTG-3 ' 

GGTCAATACTCTCCATTCTTCTCTTTGGAAG - 3 ' 
. CTTCCAAAGAGAAGAATGGAGAGTATTGACC - 3 ' 

■CATACTTCTCTTTGGCAGACGAATCTGC-3 ' 
•GCAGATTCGTCTGCCAAAGAGAAGTATG-3 ' 

Aat II 

.CTCCAGACG7X:CCAAAGGACTGTAGAGTTAC-3 ' 
-GTAACTCTACAGTCCTTTGGGACGTCTGGAG-3 ' 

Aat II 

-CTCCAGACGTCCCAGACGGCTGTAGAGTTAC- 3 ' 
• GTAACTCTACAGCCGTCTGGGA CGTCTGGAG- 3 ' 

-GATACCCAACTTCTTCTGCGTCTAAGGCTTACTCTG- 3 ' 

- CAGAGTAAGCCTTAGACGCAGAAGAAGTTGGGTATC - 3 ' 

Sea I 

-CTTCTAAGTCTAAGAAGTACTCTGCTTTG-3 ' 
-CAAAGCAGAGTACTTCTTAGACTTAGAAG-3 ' 

-GCTTACTCTGCTTTGATTGAACGGATTCAAAAGAACGCTAC- 3 ' 
-GTAGCGTTCTTTTGAATCCGTTCAATCAAAGCAGAGTAAGC- 3 ' 

- CCATTCGGTGAACAGCAAATGGTTAACTC - 3 ' 
-GAGTTAACCATTTGCTGTTCACCGAATGG- 3 ' 

Nru I 

- GATACAAGGCTC TCGCGAGAAACATTGTTC - 3 ' 

- GGAACAATGTTTC 7UGCGAGAGCCTTGTATC - 3 ' 

Bss HI 

-GATTGTTCCATTCGTGCGCGCTTCTG 
-GAACCAGAAGa?a?CACGAATGGAACAATC- 3 ' 



5808.010/220999/KaPe 



41 
Bel I 

D197N 5 ' -CTCCAGTTATTAACGr(3ArCATTCCAGAAGG-3 ' 
5 ' -CCTTCTGGAATOATCACGTTAATAACTGGAG- 3 ' 

5 Apa I 

S187A 5 ' -GGCTGACCCAGGGGCCCAACCACACCAAGC-B ' 
5 ' -GCTTGGTGTGGTTGffGCCCCTGGGTCAGCC- 3 ' 

Nco I 

10 T214L 5'- CACTTTGGACCATGGTCTTTGTACTGCTTTCG - 3 ' 
5 ' -CGAAAGCAGTACAAAGACGATGGTCCAAAGTG-3 ' 

Avr II 

E 2 2 2 T 5 ' - GCTTTCGAAGACTCTACCC rAGGTGACGACGTTG - 3 
15 5 ' - CAACGTCGTCACCTAGGGTAGAGTCTTCGAAAGC - 3 

V2 2 7 A 5 ' - GGTGACGACGCTGAAGCTAACTTCAC - 3 ' 
5 ' -GTGAAGTTAGCTTCAGCGTCGTCACC-3 ' 

20 Sac II 

L2 3 4 V 5 ' - CTAACTTCACCGOTGTGTTCGCTCCAG - 3 ' 
5 ' -CTGGAGCGAACACCGCGGTGAAGTTAG-3 ' 

A2 3 8P 5 ' - GCTTTGTTCGCTCCACCTATTAGAGCTAGATTGG - 3 

25 5 ' - CCAATCTAGCTCTAATAGGTGGAGCGAACAAAGC - 3 

Hpa 1 

T2 5 IN 5 ' - GCCAGGTGTTAACTTGACTGACGAAG - 3 ' 
5 ' - TTCGTCAGTCAAGrrAACACCTGGC - 3 ' 

30 

Aat II 

Y2 5 9N 5 ' -GACGAAGACGTrrGTTAACTTGATGGAC- 3 ' 

5' -GTCCATCAAGTTAACGACGrcrTCGTC-3' 

35 Asp I 

E267D 5 ' -GTCCATTCGACACTGrcGCTAGAACTT C-3 ' 
5 ' -GAAGTTCTAGCGACAGreTCGAATGGAC-3 ' 

E2 7 7Q 5 ' - CTGACGCTACTCAGCTGTCTCCATTC - 3 ' 

40 5 ' - GAATGGAGACAGCTGAGTAGCGTCAG - 3 ' 

A2 8 3D 5 ' - GTCTCCATTCTGTGATTXGTXCACTCAC - 3 ' 
S' -GTGAGTGAACAAATCACAGAATGGAGAC-3 ' 

45 Ksp I 

H2 8 7A 5 ' -GCTTTGTTCACCGCGGACGAATGGAG- 3 ' 
5 ' -CTCCATTCGTCCGCGGTGAACAAAGC-3 ' 
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10 



R291I 



Q2 92A 



A3 20V 



15 R329H 



20 



25 



S364T 



1366V 



A3 79K 



30 S396A 



G404A 



35 



Q415E 



40 A437G 



A463E 



45 



Bam HI 

-CACGACGAATGGATCCAATACGACTAC- 3 ' 

- GTAGTCGTATTGGA TCCATTCGTCGTG - 3 ' 

Bsi WI 

- GACGAATGGAGAGCGTACGACTACTTG- 3 ' 

- CAAGTAGTCGTACGCTCTCCATTCGTC - 3 ' 

Hpa I 

-GGTGTTGGTTTCGTrAACGAATTGATTGC- 3 ' 
-GCAATCAATTCGTTAAOGAAACCAACACC-3 ' 

(Bgl II) 

-GCTAGATTGACTCACrCTCCAGTTCAAG- 3 ' 
-CTTGAACTGGAGAGrGAGrcAATCTAGC-3 ' 

Eco RV 

- CTCACGACAACACTATGATA TCTATTTTCTTC - 3 ' 
-GAAGAAAATAGATATCATAGTGTTGTCGTGAG- 3 ' 

Nco I 

-CGACAACTCCArGGTTTCTATTTTCTTCGC-3 ' 
-GCGAAGAAAATAGAAACCAroGAGTTGTCG-3 ' 

Kpn I 

-GTACAACGGTACCAAGCCATTGTCTAC-3 ' 
-GTAGACAATGGCTTGGTACa3TTGTAC-3 ' 

-CTGACGGTTACGCTGCTTCTTGGAC-3 ' 
-GTCCAAGAAGCAGCGTAACCGTCAG-3 ' 

-CTGTTCCATTCGCTGCTAGAGCTTAC-3 ' 

- GTAAGCTCTAGCAGCGAATGGAACAG- 3/ 

-GATGCAATGTGAAGCTGAA^AGG^^ 3 ' 
-GGTTCCTTTTCAGCTTCACATTGCAT^^^^^ ' 

Sal I 

■ CACGGTTGTGGTGTCGACT^GTTGGG - 3 ' 
- CCCAACTTGTCGACACCACi^ - 3 ' 

Mun I 

-GATCTGGTGGCAArTGGGAGGAATGTTTCG-3 ' 

- CGAAACATTCCTCC CAA TTOCCACCAGATC - 3 ' 
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and accordingly for other mutations. 

The temperature optimum of the purified phytases, 
expressed in Saccharomyces cerevisiae (Example 7) , was 
5 determined as outlined in Example 9. Table 5 shows the effect on 
the stability of consensus phytase for each mutation introduced. 

Table 5 

Stability effect of the individual amino acid replacements in 

10 consensus phytase- 1 

(+ or - means a positive^ respectively, negative effect on the 
protein stability up to 1°C, ++ and — means a positive, 
respectively, negative effect on the protein stability between 1 
and 3^*0; the number 10 or 11 corresponds to the consensus 

15 phytase sequence that suggests the amino acid replacement.) 
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Stabilizing neutral destabilizi 



mutation 


effect 


mutation 


ef f ec 
t 


mutation effect 


E58A (10) 


+ 


D69A 


± 


Y54F 


(10) 


D69K (11) 


+ 


D70G (10) 


+ 


V73I 


D197N (10) 


+ 


N134Q (10) 


± 


A94K 


(10) 


T214L (10) 


+ + 


G186H 


± 


AIOIR 


(11) 


E222T (11) 


+ + 


S187A (10) 


± 


K153N 


(11) 


E267D (10) 




T214V 


± 


I158V 


(10) - - 


R2 911* 


+ 


T251N (10) 


± 


G2 03A 




R329H (10) 


+ 


Y259N (10) 


± 


G205S 




S364T (10) 


+ + 


A283D (10) 


± 


A217V 


- 


A379K (11) 


+ 


A320V (10) 


± 


V22 7A 


(11) - - 


G404A (10) 


+ + 


K44 5T 


± 


L234V 


(10) 






A4 6 3 E no) 


X 


A238P 


(10) - - 










E277Q 


(10) 










H2 87A 


(11) 










Q292A 


(10) 










I366V 


(10) 










S396A 


(10) - - 










Q415E 


(11) 










A437G 


(10) - - 










E451R 





*: This amino acid replacement was found in another round of 



mutations . 

5 We combined eight positive mutations (E58A, D197N, E267D, 

R291I, R329H, S364T, A379K, G404A) in the consensus phytase 
using the primers and the technique mentioned above in this 
example. Furthermore, the mutations Q50T and K91A were 
introduced which mainly influence the catalytical 

10 characteristics of the phytase (see patent application EP 
97810175.6 and EP 97112688 as well as Example 9). The DNA and 
amino acid sequence of the resulting phytase gene (consensus 
phytase-thermo[8]-Q50T-K91A) is shown in Figure 7. In this way, 
the temperature optimum and the melting point of the consensus 

15 phytase was increased by 7°C (Figure 15, 16, 17). 

Using the results of Table 5, we further improved the 
thermostability of consensus phytase 10 by the following back 
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mutations K94A, V158I, and A396S that revealed a strong negative 
influence on the stability of consensus phytase. The resulting 
protein is phytase-lO-thermo [3] . Furthermore, we introduced the 
mutations Q50T and K91A which mainly influence the catalytical 
5 characteristics of consensus phytase (see patent application EP 
97810175.6 and EP 97112688 as well as Example 9 and Figure 14 
and 15) - The resulting DNA and amino acid sequence is shown in 
Figure 8. The optimized phytase showed a 4°C higher temperature 
optimum and melting point than consensus phytase 10 (Figure 12 
10 and 13) . Furthermore, the phytase has also a strongly increased 
specific activity with phytate as substrate of 250 U/mg at pH 
5. 5 (Figure 14) . 

Example 4 

15 Stabilization of the phytase of A> fumigatus ATCC 13073 by 

replacement of amino acid residues with the corresponding 
consensus phytase-1 and consensus phytase-10 residues 

At six typical positions where the A. fumigatus 13073 is 
20 the only or nearly the only phytase in the alignment of Figure 1 
that does not contain the corresponding consensus phytase amino 
acid residue, the non-consensus amino acid residue was replaced 
by the consensus one. In a first round, the following amino 
acids were substituted in A. fumigatus 13073 phytase, containing 
25 the Q27(24)T substitution and the signal sequence of A. terreus 
cbs.116.4 6 phytase (see European Patent Application No. 
97810175.6 and Figure 9): F55(28)Y, V100(73)I, F114(87)Y, 
A243(220)L, S265(242)P, N294(282)D. 

The numbers in parentheses confer to the numbering of 
30 Figure 1. Number 27 in the mutation Q27(24)T refers to the 
sequence numbering of Anigmature (phytase of A. niger (ficuum) 
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NRRL 3135) shown in Fig. 1 of EP 0897010. EP 0897010 is hereby 
incorporated by reference. 

In a second round, four of the seven stabilizing amino 
acid exchanges (E59A, R329H, S364T, G404A) found in the 
5 consensus phytase-10 sequence and, tested as single mutation in 
consensus phytase-1 (Table 5) , were additionally introduced into 
the A. fumigatus a-mutant. Furthermore, the amino acid 
replacement S126(154)N, shown to reduce the protease 
susceptibility of the phytase, was introduced. The number 126 in 

10 the mutation S126(154)N refers to the sequence Afummature shown 
in Fig. 1 of EP0897010. 

The mutations were introduced as described in example 3 
(see Table 6) and expressed as described in example 6 to 8. The 
resulting A, fumigatus 13073 phytase variants were called a - 

15 mutant (i.e. the A. fumigatus ATCC 13073 phytase with the 
substitutions Q24T, F28Y, V73I, F87Y, A220L, S242P, N282D) and 
alpha-mutant or optimized alpha-mutant (i.e. the A. fumigatus 
alpha-mutant having the additional substitutions E59A-S154N- 
R329H-S364T-G404A) . K68A is an additional preferred mutation. 

20 The temperature optimum (60°C, Figure 20) and the melting 

point (67.0**C, Figure 19) of the A. fumigatus 13073 phytase 
alpha-mutant was increased by S'^C in comparison to the values of 
the wild-type (temperature optimum: 55°C, Tm: 60 °C) , The five 
additional amino acid replacements further increased the 

25 temperature optimum by 3^C (Figure 20) , 

Table 6 

Mutagenesis primers for stabilization of A. fumigatus phytase 
ATCC 13073 



30 
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Mutation Primer 
F55Y 



E58A 



10 



30 



35 



5 ' -CACGTACTCGCCATACTTTTCGCTCGAG-3 ' 
5 ' -CTCGAGCGAAAAGTATGGC(3AGTACGTG-3 ' 

(Xho I) 

5 ' -CCATACTTTTCGCTCGCGGACGAGCTGTCCGTG-S 
5 ' -CACGGACAGCTCGTCOffCGAGCGAAAAGTAGG- 3 ' 



VI 0 0 1 5 ' - GTATAAGAAGCTTATTACGGCGATCCAGGCC - 3 ' 

5 ' -GGCCTGGATCGCCGTAATAAGCTTCTTATAC-3 

F114Y 5 ' -CTTCAAGGGCAAGTACGCCTTTTTGAAGACG-3 ' 

5 ' -CGTCTTCAAAAAGGCGTACTTGCCCTTGAAG- 3 ' 

15 A243L 5' -CATCCGAGCTCGCCTCGAGAAGCATCTTC-3' 

5 ' -GAAGATGCTTCTCGAGGCGAGCTCGGATG-3 ' 

S265P 5'-CTAATGGA TGTGTCCOTTTGATACGGTAG-3 ' 

5 ' -CTACCGTATCAAACGOACACATGTCCATTAG- 3 ' 

20 

N2 94D 5 ' -GTGGAAGAAGTACGACTACCTTCAGTC- 3 ' 

5 ' -GACTGAAGGTAGTCGTACTTCTTCCAC-3 ' 

(Mlu I) 

25 R329H 5 ' -GCCCGGTTGACGCa7TCGCCAGTGCAGG-3 ' 

5 ' -CCTGCACTGGCGAArGCG7X:y^CCGGGC-3 ' 



Nco I 

S3 64 T 5 ' - CACACGACAACACCATGGTTTCCATCTTC - 3 ' 

5 ' -GAAGATGGAAACCMSfiTGTTGTCGTGTG- 3 ' 

(Bss HI) 

G4 0 4 A 5 ' - GTGGTGCCTTTCGCCGCGCGAGCCTACTTC - 3 ' 

5' -GAAGTAGGCTCGCGCGGrCGAAAGGCACCAC-3 ' 

Example 5 

Introduction of the ac tive site amino acid residues of th^ 
A. niqer NRRL 3135 phytase into the consensus phytase-l 

We used the crystal structure of the Aspergillus 
40 niger NRRL 3135 phytase to define all active site amino acid 
residues (see Reference Example and EP 97810175.6). Using the 
alignment of Figure 1, we replaced the following active site 
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residues and additionally the not identical adjacent ones of the 
consensus phytase by that of the A, niger phytase: 

S89D, S92G, A94K, D164S, P201S, G203A, G205S, H212P, 
G224A, D22 6T, E255T, D256E, V258T, P2 65S, Q292H, G300K, y305H, 
5 A314T, S364G, M365I, A397S, S398A, G404A, and A405S 

The new protein sequence consensus phytase -7 was 
backtranslated into a DNA sequence (Figure 10) as described in 
Example 1 . The corresponding gene ( f cp7 ) was generated as 
described in Example 1 using the following oligonucleotide 
10 mixes: 

Mix 1.7: CP-1, CP-2, CP-3, CP-4.7, CP-5.7, CP-6^ CP-7, CP- 
8.7, CP-9, CP-10.7 



Mix 2.7: CP-9, CP-10.7, CP-11.7, CP-12.7, CP-13.7, CP-14-7, 
15 CP-15.7, CP-16, CP-17.7, CP-18.7, CP-19.7, CP-20, 

CP-21, CP-22 . 



The DNA sequences of the oligonucleotides are indicated in 
Figure 3. The newly synthesized oligonucleotides are 

20 additionally marked by number 7. After assembling of the 
oligonucleotides using the same PGR primers as mentioned in 
Example 1 , the gene was cloned into an expression vector as 
described in Examples 6-8. 

The pH-profile determined after expression in H. 

25 polymorpha and purification was shifted into the acidic range 
of the pH-spectrum showing an optimum at pH 4.5-5.0 (see Figure 
18). The enzyme had a broad pH-optimum reaching at least 60% of 
its maximum activity from pH 2.5 to pH 6.0. Up to pH 5.0, the 
profile resembled the profile of the A. niger NRRL 3135 phytase. 

30 However, below pH 5.0 it lacked the typical low at pH 4 . 0 of the 
profile of A, niger phytase. 
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Example 6 

Expression of the consensus phytase genes in Hansenula 
polymorpha 

5 The phytase expression vectors, used to transform H. 

polymorpha RBll (Gellissen et al., 1994), was constructed by 
inserting the Eco RI fragment of pBsk-fcp or variants thereof 
into the multiple cloning site of the H. polymorpha expression 
vector PFPMT121, which is based on an ura3 selection marker from 
10 S. cerevisiae, a formate dehydrogenase (FMD) promoter element 
and a methanol oxidase (MO) termiraator element from H. 
polymorpha. The 5' end of the fcp gene is fused to the FMD 
promoter, the 3' end to the MOX terminator (Gellissen et al . , 
1996; EP 0299 108 B) . The resulting expression vector are 
15 designated pFPMTfcp, pFPMTfcplO, pFPMTfcp7. 

The constructed plasmids were propagated in E. coli. 
Plasmid DNA was purified using standard state of the art 
procedures. The expression plasmids were transformed into the H. 
polymorpha strain RPll deficient in orotidine-5' -phosphate 
20 decarboxylase (ura3) using the procedure for preparation of 
competent cells and for transformation of yeast as described in 
Gelissen et al. (1996). Each transformation mixture was plated 
on YNB (0.14% w/v Difco YNB and 0.5% ammonium sulfate) 
containing 2% glucose and 1.8% agar and incubated at 37 °C. 
25 After 4 to 5 days individual transformant colonies were picked 
and grown in the liquid medium described above for 2 days at 37 
"C. subsequently, an aliquot of this culture was used to 
inoculate fresh vials with YNB-medium containing 2% glucose. 
After seven further passages in selective medium, the expression 
30 vector integrates into the yeast genome in multimeric form, 
subsequently, mitotically stable transformants were obtained by 
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two additional cultivation steps in 3 ml non-selective liquid 
medium (YPD, 2% glucose, 10 g yeast extract, and 20 g peptone) . 
In order to obtain genetically homogeneous recombinant strains 
an aliquot from the last stabilization culture was plated on a 
5 selective plate. Single colonies were isolated for analysis of 
phytase expression in YNB containing 2% glycerol instead of 
glucose to derepress the fmd promoter. Purification of the 
consensus phytases was done as described in Example 7. 

10 Example 7 

Expression of the consensus phytase genes in Saccharomyces 
cerevisiae and purification of the phytases from culture 
supernatant 

The consensus phytase genes were isolated from the 

15 corresponding Bluescript-plasmid (pBsk-fcp, pBSK-fcplO, pBsk- 
fcp7) and ligated into the Eco RI sites of the expression 
cassette of the Saccharomyces cerevisiae expression vector pYES2 
(Invitrogen, San Diego, CA, USA) or subcloned between the 
shortened GAPFL (glyceraldhyde-3-phosphate dehydrogenase) 

20 promoter and the pho5 terminator as described by Janes et al. 
(1990) . The correct orientation of the gene was checked by PGR. 
Transformation of S, cerevisiae strains. e- g. INVScl 
(Invitrogen, San Diego, CA, USA) was done according to Hinnen et 
al. (1978). Single colonies harboring the phytase gene under the 

25 control of the GAPFL promoter were picked and cultivated in 5 ml 
selection medium (SD-uracil, Sherman et al., 1986) at 30°C under 
vigorous shaking (250 rpm) for one day. The preculture was then 
added to 500 ml YPD medium (Sherman et al., 1986) and grown 
under the same conditions. Induction of the gall promoter was 

30 done according to manufacturer's instruction. After four days of 
incubation cell broth was centrifuged (7000 rpm, GS3 rotor, 15 
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min, S'^C) to remove the cells and the supernatant was 
concentrated by way of ultrafiltration in Amicon 8400 cells 
(PM30 membranes) and ultrafree-15 centrifugal filter devices 
(Biomax-30K, Millipore, Bedford, MA, USA) . The concentrate (10 
5 ml) was desalted on a 40 ml Sephadex G25 Superfine column 
(Pharmacia Biotech, Freiburg, Germany), with 10 mM sodium 
acetate, pH 5.0, serving as elution buffer- The desalted sample 
was brought to 2 M (NH4)2S04 and directly loaded onto a 1 ml 
Butyl Sepharose 4 Fast Flow hydrophobic interaction 

10 chromatography column (Pharmacia Biotech, Feiburg, Germany) 
which was eluted with a linear gradient from 2 M to 0 M (NH4)2S04 
in 10 mM sodium acetate, pH 5.0, Phytase was eluted in the 
break-through, concentrated and loaded on a 120 ml Sephacryl S- 
300 gel permeation chromatography column (Pharmacia Biotech, 

15 Freiburg, Germany) . Consensus phytase and consensus phytase -7 
eluted as a homogeneous symmetrical peak and was shown by SDS- 
PAGE to be approx. 95% pure. 

Example 8 

20 Expression of the consensus phytase genes in Aspergillus 

niger 

The Bluescript-plasmids pBsk-fcp, pBSK-fcplO, and pBsk- 
fcp7 were used as template for the introduction of a Bsp Hl-site 
upstream of the start codon of the genes and an Eco RV-site 
25 downstream of the stop codon. The ExpandTM High Fidelity PGR Kit 
(Boehringer Mannheim, Mannheim, Germany) was used with the 
following primers: 

Primer Asp-1: 
30 Bsp HI 

5 ' -TATArCArGAGCGTGTTCGTCGTGCTACTGTTC-3 ' 



5808 . 010/220999/KaPe 



52 

Primer Asp-2 used for cloning of fcp and f cp7 : 

Eco RV 

5 3 ' -ACCCGACTTACAAAGCGAATTCTArAGATATAT-S ' 

Primer Asp-3 used for cloning of fcplO: 

Eco RV 

10 3' -ACCCTTCTTACAAAGCGAATTCrArAGATATAT-5' 

The reaction was performed as described by the supplier. 
The PCR-amplif ied fcp-genes had a new Bsp HI site at the start 
codon, introduced by primer Asp-1, which resulted in a 

15 replacement of the second amino acid residue glycine by serine. 
Subsequently, the DNA-fragment was digested with Bsp HI and Eco 
RV and ligated into the Nco I site downstream of the 
glucoamylase promoter of Aspergillus niger (glaA) and the Eco RV 
site upstream of the Aspergillus nidulans tryptophan C 

20 terminator (trpC) (Mullaney et al., 1985). After this cloning 
step, the genes were sequenced to detect possible failures 
introduced by PGR, The resulting expression plasmids which 
basically corresponds to the pGLAC vector as described in 
Example 9 of EP 684 313, contained the orotidine-5 ' -phosphate 

25 decarboxylase gene (pyr4) of Neurospora crassa as a selection 
marker. Transformation of Aspergillus niger and expression of 
the consensus phytase genes was done as described in EP 684 313, 
The consensus phytases were purified as described in Example 7, 



30 Example 9 
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Determination of phytase activity and of temperature 
optimum 

Phytase activity was determined basically as described by 
Mitchell et al (1997) . The activity was measured in an assay 
5 mixture containing 0.5% phytic acid («5 mM) in 200 mM sodium 
acetate, pH 5.0. After 15 min of incubation at 31°Cr the 
reaction was stopped by addition of an equal volume of 15% 
trichloroacetic acid. The liberated phosphate was quantified by 
mixing 100 pi of the assay mixture with 900 yl H20 and 1 ml of 

10 0.6 M H2SO4 , 2% ascorbic acid and 0.5% ammonium molybdate. 
Standard solutions of potassium phosphate were used as 
reference. One unit of enzyme activity was defined as the amount 
of enzyme that releases 1 )xmol phosphate per minute at 37 ^'C. The 
protein concentration was determined using the enzyme extinction 

15 coefficient at 280 nm calculated according to Pace et al (1995) : 
consensus phytase, 1.101; consensus phytase 7, 1.068; consensus 
phytase 10, 1.039. 

In case of pH-cptimum curves, purified enzymes were 
diluted in 10 mM sodium acetate, pH 5.0. Incubations were 

20 started by mixing aliquots of the diluted protein with an equal 
volume of 1% phytic acid («10 mM) in a series of different 
buffers: 0.4 M glycine/HCl, pH 2.5; 0.4 M acetate/NaOH, pH 3.0, 
3.5, 4.0, 4.5, 5.0, 5.5; 0.4 M imidazole/HCl , pH 6.0, 6-5; 0.4 M 
Tris/HCl pH 7.0, 7,5, 8.0, 8-5, 9.0- Control experiments showed 

25 that pH was only slightly affected by the mixing step. 
Incubations were performed for 15 min at 37**C as described 
above . 

For determinations of the substrate specificities of the 
phytases, phytic acid in the assay mixture was replaced by 5 mM 
30 concentrations of the respective phosphate compounds. The 
activity tests were performed as described above. 
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For determination of the temperature optimum, enzyme (100 
pi) and substrate solution (100 vlI) were pre-incubated for 5 min 
at the given temperature. The reaction was started by addition 
of the substrate solution to the enzyme. After 15 min 

5 incubation, the reaction was stopped with trichloroacetic acid 
and the amount of phosphate released was determined. 

The pH-optimum of the original consensus phytase was 
around pH 6.0-6.5 (70 U/mg) . By introduction of the Q50T 
mutation, the pH-optimum shifted to pH 6.0 (130 U/mg). After 

10 introduction of K91A, the pH optimum shifted one pH-unit into 
the acidic pH-range showing a higher specific activity between 
pH 2.5 and pH 6.0. That was shown for the stabilized mutants and 
for consensus phytase-10, too (Figure 14 and 15) . 

consensus phytase-7, which was constructed to transfer the 
15 catalytic characteristics of the A. niger phytase NRRL 3135 into 
the consensus phytase, had a pH-profile which is shifted into 
the acidic range of the pH-spectrum showing an optimum between 
pH 4.5 and 5.0 (see Figure 19). The enzyme had a broad pH- 
optimum reaching at least 60% of its increased maximum activity 
20 from pH 2.5 to pH 6.0. The substrate spectrum, too, resemble 
more to that of the A. niger NRRL 3135 phytase than to the 

consensus phytase-1. 

The temperature optimum of consensus phytase-1 (71°C) was 
16-26''C higher than the temperature optimum of the wild-type 
25 phytases (45-55''C, Table 7) which were used to calculate the 
consensus sequence. The improved consensus phytase-10 showed a 
further increase of its temperature optimum to 80=0 (Figure 11) . 

The temperature optimum of the consensus phytase-1- 
thermo[81 was found in the same range (78°C) using the 
30 supernatant of an overproducing S. cerevisiae strain. The 
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highest temperature optimum reached of 82°C was determined for 
consensus phytase-10-thermo-Q50T-K91A. 

Table 7 

5 Temperature optimum and Tm-value of consensus phytase and 

of the phytases from A. fumiqatusr A. niger, E. nidulans, 
and M. thermophila. 

The determination of the temperature optimum was performed 
10 as described in Example 9. The Tm-values were determined by 
differential scanning calorimetry as described in Example 10. 



phytase 


temperature 
optimum [^C] 




Consensus phytase-10- 
c ne nuo o u x is.-? j./^ 


82 


89.3 


Consensus phytase - 10- 
thermo-Q50T 


82 


88.6 


Consensus phytase- 10 


80 


85 .4 


Consensus phytase- 1- 
thermo [8] -Q5 0T 


78 


84.7 


Consensus phytase-1- 
thermo[8] -Q50T-K91A 


78 


85 .7 


Consensus phytase- 1 


71 


78 . 1 


A. niger NRRIi3135 


55 


63 .3 


A. fumigatus 13073 


55 


62 .5 


A, fumigatus 13073 
a -mutant 


60 


67.0 


A, fumigatus 13073 
a-mutant (optimized) 


63 




A . terreus 9A- 1 


49 


57.5 


A. terreus cbs, 116.46 


45 


58 . 5 


E. nidulans 


45 


55 . 7 


M. thermophi 1 a 


55 


n. d. 


T . thermophi 1 us 


45 


n. d. 
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Example 10 

Determination of the melting point by differential 
scanning calorimetry (DSC) 
5 In order to determine the unfolding temperature of the 

phytases, differential scanning calorimetry was applied as 
previously published by Brugger et al (1997). Solutions of 50-60 
mg/ml homogeneous phytase were used for the tests. A constant 
heating rate of 10°C/min was applied up to 90-95°C. 
10 The determined melting points reflect the results obtained 

for the temperature optimums (Table 7) . The most stable 
consensus phytase designed is consensus phytase-10-thermo-Q50T- 
K91A showing a melting temperature under the choosen condition 
of 89.3**C- This is 26 to 33, 6^*0 higher than the melting point of 
15 the wild-type phytases used. 

Example 11 

Transfer of basidiomycete phytase active site into 

consensus phytase- 10- thermo-Q50T-K91 A 
20 As described previously (Example 3) , mutations derived 

from the basidiomycete phytase active site were introduced into 
the consensus phytase 10 . The following five constructs a) to e) 
were prepared: 

a) This construct is called consensus phytase 12, and it 
25 comprises a selected number of active site residues of the 

basidio consensus sequence, its amino acid sequence (consphyl2) 
is shown in Fig. 21 (the first 26 amino acids forms the signal 
peptide, amended positions are underlined) ; 

b) a cluster of mutations (Cluster II) was transferred to 
30 the consensus 10 sequence, viz . : S80Q, Y86F, S90G, K91A, S92A, 

K93T, A94R, Y95I; 
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c) analogously, another cluster of mutations (Cluster 
III) was transferred, viz.: T129V, E133A, Q143N, M136S, V137S, 
N138Q, S139A; 

d) analogously, a further cluster of mutations (Cluster 
5 IV) was transferred, viz.: A168D, E171T, K172N, F173W; 

e) and finally, a further cluster of mutations (Cluster 
V) was transferred, viz.: Q297G, S298D, G300D, Y305T- 

These constructs were expressed as described in Examples 6 

to 8. 

10 

Example 12 

Phytase alignment using GAP 

The phytases described herein - i.e. the amino acid 
sequences as well as the corresponding DNA sequences - were 
15 aligned against each other. Also some other phytases were 
correspondingly aligned, viz. the following: 

■ the consensus phytase described in EP 0897985; 

■ the phytase derived from Aspergillus niger (ficuum) NRRL 3135 
(A. niger NRRL 3135) described in EP 0420358; 

20 ■ the phytases derived from Aspergillus fumigatus ATCC 13073 (A. 
fumigatus 13073); Aspergillus fumigatus ATCC 32239 (A. 
fumigatus 32239); Aspergillus terreus CBS 116.46 (A.terreus 
cbs) ; Aspergillus nidulans (E . nidulans) ; and Talaromyces 
thermophilus (T. thermophilus) - all described in EP 0897010; 

25 ■ the phytases derived from Myceliophthora thermophila (M. 
thermophila) ; and Aspergillus terreus 9-Al (A. terreus 9-Al) - 
both described in EP 0684313; 

■ the phytase derived from Thermomyces lanuginosus 
(T.lanuginosus) described in WO 9735017 (PCT/US97/04559) ; 

30 ■ the phytases derived from Agrocybe pediades (A. pediades) , 
Paxillus involutus 1 and 2 (P. involutus 1 and 2); and 
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Trametes pubescens (T. pubescens) - all described i WO 
98/28409; and 

■ the phytase derived from Peniophora lycii (P. lycii) described 
in WO 98/28408. 

5 For the alignments, the program GAP was used with the 

settings as described above. 

For polypeptide comparisons, the signal peptide were 
included. However, for alignment to consensus phytase 11, the 
signal peptides were excluded and only the mature protein part 
10 of the other sequences were compared to it . 

The results are shown in Table 8 below. The first number 
in each box or cell is the amino acid similarity, the second 
number is the amino acid identity. 

For DNA sequence comparisons, the signal sequence was 
15 included (the same in all phytases) . 

The results are shown in Table 9 below. 

The following embodiments are preferred: 
20 Phytases and corresponding DNA sequences related to 

consensus phytase 10 (CPIO, Fcp 10) 

A phytase which comprises an amino acid sequence which is 
at least 93.80%, preferably at least 94, 94.5, 95, 95.5, 96, 
96.5, 97, 97.5, 98, 98.5, 99, 99.5% identical to the sequence of 
25 amino acids 1-467 of consensus phytase 10 (FcplO) as shown in 
Fig. 5, 

A phytase which comprises an amino acid sequence which is 
at least 95.09%, preferably at least 95.5, 96, 96.5, 97, 97.5, 
98, 98.5, 99, 99.5% similar to the sequence of amino acids 1-467 
30 of consensus phytase 10. 
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A phytase which is encoded by a DNA sequence which is at 
least 95.88, preferably at least 96, 96,5, 97, 97.5, 98, 98,5, 
99, 99.5% identical to nucleotides 12-1412 of the DNA sequence 
of consensus phytase 10 (FcplO) as shown in Fig. 5. 
5 A DNA sequence which encodes a phytase and which (i) is at 

least 95.88%, preferably at least 96, 96,5, 97, 97,5, 98, 98.5, 
99, 99.5% identical, or (ii) hybridizes under low, preferably 
medium, medium/high, high, very high stringency conditions to 
nucleotides 12-1412 of the DNA sequence of consensus phytase 10 

10 (FcplO) as shown in Fig. 5. A preferred negative control is DNA 
encoding consensus phytase. A preferred positive control is DNA 
encoding any of CPIO, CPlO-thermo (3) -Q50T, K91A, CPl-thermo ( 8 ) , 
CPl-therrao (8)Q50T,K91A. 

A DNA sequence which encodes a phytase comprising an amino 

15 acid sequence which is at least 93.80%, preferably at least 94, 
94.5, 95, 95,5, 96, 96.5, 97, 97.5, 98, 98.5, 99, 99.5% 
identical to the sequence of amino acids 1-4 67 of consensus 
phytase 10 (FcplO) as shown in Fig. 5. 

20 Phytases and corresponding DNA sequences related to 

consensus phytase 10 thermoQ) Q50T, K91A 

A phytase which comprises an amino acid sequence which is 
at least 93.37%, preferably at least 93.5, 94, 94,5, 95, 95,5, 
96, 96.5, 97, 97.5, 98, 98.5, 99, 99.5% identical to the 

25 sequence of amino acids 1-467 of consensus phytase 10 thermo(3) 
Q50T, K91A as shown in Fig. 8. 

A phytase which comprises an amino acid sequence which is 
at least 94.66%, preferably at least 95.0, 95.5, 96, 96.5, 97, 
97.5, 98, 98.5, 99, 99.5% similar to the sequence of amino acids 

30 1-467 of consensus phytase 10 thermo{3) Q50T, K91A as shown in 
Fig, 8, 
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A phytase which is encoded by a DNA sequence which is at 
least 95.88, preferably at least 96, 96.5, 97, 97.5, 98, 98.5, 
99, 99.5% identical to nucleotides 12-1412 of the DNA sequence 
of consensus phytase 10 thermo(3) Q50T, K91A as shown in Fig. 8. 
5 A DNA sequence which encodes a phytase and which (i) is at 

least 95.88%, preferably at least 96, 96.5, 97, 97.5, 98, 98,5, 
99, 99.5% identical, or (ii) hybridizes under low, preferably 
medium, medium/high, high, very high stringency conditions to 
nucleotides 12-1412 of the DNA sequence of consensus phytase 10 

10 thermo(3) Q50T, K91A as shown in Fig, 8, A preferred negative 
control is DNA encoding consensus phytase. A preferred positive 
control is DNA encoding any of CPIO, CPlO-thermo (3) -Q50T, K91A, 
CPl-thermo (8) , CPl-thermo (8) Q50T, K91A. 

A DNA sequence which encodes a phytase comprising an amino 

15 acid sequence which is at least 93.37%, preferably at least 
93.5, 94, 94.5, 95, 95.5, 96, 96,5, 97, 97.5, 98, 98,5, 99, 
99-5% identical to the sequence of amino acids 1-4 67 of 
consensus phytase 10 thermo{3) Q50T, K91A as shown in Fig. 8, 

20 Phytases and corresponding DNA sequences related to 

consensus phytase l-thermo(8) 

A phytase which comprises an amino acid sequence which is 
at least 98.30%, preferably at least 98.5, 99, 99.5% identical 
to the sequence of amino acids 1-467 of consensus phytase 1 
25 thermo(8) (as shown in Fig. 7, backmutations T50Q,A91K to be 
added) . 

A phytase which comprises an amino acid sequence which is 
at least 98.51%, preferably at least 99, 99.5% similar to the 
sequence of amino acids 1-4 67 of consensus phytase 1 thermo(8) 
30 (as shown in Fig. 7, backmutations T50Q,A91K to be added) . 
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A phytase which is encoded by a DNA sequence which is at 
least 98.73, preferably at least 98.5, 99, 99.5% identical to 
nucleotides 1-1407 of the DNA sequence of consensus phytase 1 
thermo(8) (as shown in Fig. 7, backmutations T50Q,A91K to be 
5 added) . 

A DNA sequence which encodes a phytase and which (i) is at 
least 98.73, preferably at least 98-5, 99, 99.5% identical, or 
(ii) hybridizes under low, preferably medium, medium/high, high, 
very high stringency conditions to nucleotides 1-1407 of the DNA 

10 sequence of consensus phytase 1 thermo(8) (as shown in Fig. 1, 
backmutations T50Q,A91K to be added) , A preferred negative 
control is DNA encoding consensus phytase. A preferred positive 
control is DNA encoding any of CPl-thermo ( 8 ) , CPl- 
thermo (8) Q50T, K91A. 

15 A DNA sequence which encodes a phytase comprising an amino 

acid sequence which is at least 98.30%, preferably at least 
98.5, 99, 99.5% identical to the sequence of amino acids 1-467 
of consensus phytase 1 thermo(8) (as shown in Fig. 7, 
backmutations T50Q,A91K to be added) . 

20 

Phytases and corresponding DNA sequences related to 
consensus phytase 1 thermo(8) Q50T, K91A 

A phytase which comprises an amino acid sequence which is 
at least 97.87%, preferably at least 98, 98.5, 99, 99,5% 
25 identical to the sequence of amino acids 1-467 of consensus 
phytase 1 thermo(8) Q50T, K91A as shown in Fig. 7. 

A phytase which comprises an amino acid sequence which is 
at least 98.08%, preferably at least 98.5, 99, 99.5% similar to 
the sequence of amino acids 1-467 of consensus phytase 1 
30 thermo(8) Q50T, K91A as shown in Fig. 7. 
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A phytase which is encoded by a DNA sequence which is at 
least 98.37, preferably at least 98.5, 99, 99.5% identical to 
nucleotides 1-1407 of the DNA sequence of consensus phytase 1 
thermo(8) Q50T, K91A as shown in Fig. 7. 
5 A DNA sequence which encodes a phytase and which (i) is at 

least 98.37, preferably at least 98.5, 99, 99.5% identical, or 
(ii) hybridizes under low, preferably medium, medium/high, high, 
very high stringency conditions to nucleotides 1-1407 of the DNA 
sequence of consensus phytase 1 thermo(8) Q50T, K91A as shown in 

10 Fig. 7. A preferred negative control is DNA encoding consensus 
phytase. A preferred positive control is DNA encoding any of 
CPl-thermo(8) , CPl-thermo (8) Q50T, K91A. 

A DNA sequence which encodes a phytase comprising an amino 
acid sequence which is at least 97.87%, preferably at least 98, 

15 98.5, 99, 99.5% identical to the sequence of amino acids 1-467 
of consensus phytase 1 thermo(8) Q50T, K91A as shown in Fig. 7. 

Phytases and corresponding DNA sequences related to 
consensus phytase 11 

20 A phytase which comprises an amino acid sequence which is 

at least 90.71%, preferably at least 91. 91.5, 92, 92.5, 93, 
93.5, 94, 94.5, 95, 95.5, 96, 96.5, 97, 97.5, 98, 98.5, 99, 
99.5% identical to the sequence of amino acids 1-482 of 
consensus phytase 11 as shown in Fig. 6. 

25 A phytase which comprises an amino acid sequence which is 

at least 92.07%, preferably at least 92.5, 93, 93.5, 94, 94.5, 
95, 95.5, 96, 96.5, 97, 97.5, 98, 98.5, 99, 99.5% similar to the 
sequence of amino acids 1-482 of consensus phytase 11 as shown 
in Fig. 6. 

30 A DNA sequence which encodes a phytase comprising an amino 

acid sequence which is at least 90.71%, preferably at least 91. 
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91.5, 92, 92*5, 93, 93.5, 94, 94.5, 95, 95.5, 96, 96.5, 97, 
97.5, 98, 98.5, 99, 99.5% identical to the sequence of amino 
acids 1-482 of consensus phytase 11 as shown in Fig. 6. 

5 Phytases and corresponding DNA sequences related to A. 

fumigatus alpha-mutant 

A phytase which comprises an amino acid sequence which is 
at least 97,17%, preferably at least 97,5, 98, 98.5, 99, 99.5% 
identical to the sequence of amino acids 1-4 67 of A. fumigatus 

10 alpha-mutant (phytase) as shown in Fig. 9. 

A phytase which comprises an amino acid sequence which is 
at least 97.82%, preferably at least 98, 98.5, 99, 99-5% similar 
to the sequence of amino acids 1-467 of A, fumigatus alpha- 
mutant (phytase) as shown in Fig. 9. 

15 A phytase which is encoded by a DNA sequence which is at 

least 96.13%, preferably at least 96.5, 97, 97.5, 98, 98.5, 99, 
99.5% identical to nucleotides 1-1401 of the DNA sequence of A. 
fumigatus ATCC 1307 3 alpha-mutant shown in Fig. 9. 

A DNA sequence which encodes a phytase comprising an amino 

20 acid which is at least 97,17%, preferably at least 97.5, 98, 
98.5, 99, 99.5% identical to the sequence of amino acids 1-467 
of A. fumigatus ATCC 13073 alpha-mutant shown in Fig. 9, 

A DNA sequence which encodes a phytase and which (i) is at 
least 96.13%, preferably 96.5, 97, 97.5, 98, 98.5, 99, 99.5% 

25 identical, or (ii) hybridizes under low, preferably medium, 
medium/high, high, very high stringency conditions to 
nucleotides 1-1401 of the DNA sequence of A- fumigatus ATCC 
13073 alpha-mutant shown in Fig, 9, A preferred negative control 
is DNA encoding A. fumigatus 13073. A preferred positive control 

30 is DNA encoding any of A. fumigatus ATCC 13073 and its optimised 
alpha-mutant . 
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Phytases and corresponding DNA sequences related to the 
optimized A. fumigatus alpha-mutant 

A phytase which comprises an amino acid sequence which is 
5 at least 96.08%, preferably at least 96.5, 97, 97.5, 98, 98.5, 
99, 99.5% identical to the sequence of the phytase of the 
optimized A. fumigatus alpha-mutant. 

A phytase which comprises an amino acid sequence which is 
at least 96.74%, preferably at least 97, 97.5, 98, 98,5, 99, 
10 99.5% similar to the sequence of the phytase of the optimized A. 
fumigatus alpha-mutant . 

A phytase which is encoded by a DNA sequence which is at 
least 95,63%, preferably at least 96, 96.5, 97, 97.5, 98, 98.5, 
99, 99.5% identical to nucleotides 1-1401 of the DNA sequence 
15 encoding the phytase of the optimized A. fumigatus alpha-mutant. 

A DNA sequence which encodes a phytase comprising an amino 
acid which is at least 96.08%, preferably at least 96,5, 97, 
97.5, 98, 98.5, 99, 99.5% identical to the phytase of the 
optimized A. fumigatus alpha-mutant. 
20 A DNA sequence which encodes a phytase and which (i) is at 

least 95.63%, preferably at least 96, 96.5, 97, 97.5, 98, 98.5, 
99, 99.5% identical, or (ii) hybridizes under low, preferably 
medium, medium/high, high, very high stringency conditions to 
nucleotides 1-1401 of the DNA sequence encoding the phytase of 
25 the optimized A. fumigatus alpha-mutant. 

A preferred negative control is DNA encoding A, fumigatus 
13073. A preferred positive control is DNA encoding any of A. 
fumigatus ATCC 13073 and its optimised alpha-mutant. 

30 Phytases and corresponding DNA s equences related to 

consensus phytase 7 
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A phytase which comprises an amino acid sequence which is 
at least 94.87%, preferably at least 95, 95.5, 96, 96.5, 97, 
97.5, 98, 98.5, 99, 99.5% identical to the sequence of amino 
acids 1-467 of consensus phytase 7 as shown in Fig. 10- 
5 A phytase which comprises an amino acid sequence which is 

at least 95.30%, preferably at least 95.5, 96, 96.5, 97, 97.5, 
98, 98.5, 99, 99.5% similar to the sequence of amino acids 1-467 
of consensus phytase 7 as shown in Fig, 10- 

A phytase which is encoded by a DNA sequence which is at 
10 least 96.38%, preferably 96.5, 97, 97.5, 98, 98-5, 99, 99.5% 
identical to nucleotides 12-1412 of the DNA sequence of 
consensus phytase 7 shown in Fig. 10. 

A DNA sequence which encodes a phytase and which (i) is at 
least 96.38%, preferably at least 96.5, 97, 97.5, 98, 98-5, 99, 
15 99.5% identical, or (ii) hybridizes under low, preferably 
medium, medium/high, high, very high stringency conditions to 
nucleotides 12-1412 of the DNA sequence of consensus phytase 7 
as shown in Fig. 10. 

A DNA sequence which encodes a phytase comprising an amino 
20 acid sequence which is at least 94.87%, preferably at least 95, 
95.5, 96, 96.5, 97, 97.5, 98, 98.5, 99, 99.5% identical to the 
sequence of amino acids 1-467 of consensus phytase 7 as shown in 
Fig. 10. 

25 Phytases related to basidio consensus 

A phytase which comprises an amino acid sequence which is 
at least 76.23%, preferably at least 77, 78, 79, 80, 81, 82, 83, 
84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 94.5, 95, 95.5, 96, 
96.5, 97, 97-5, 98, 98.5, 99, 99.5% identical to the combined 

30 sequence of (i) amino acids 1-441 of basidio consensus shown in 
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Fig. 3, and (ii) amino acids 1-26 shown in Fig. 5 (the sequence 
of (ii) to be added at the N-terminal of the sequence of (i) ) . 

A phytase which comprises an amino acid sequence which is 
at least 79.50%, preferably at least 80, 81, 82, 83, 84, 85, 86, 
5 87, 88, 89, 90, 91, 92, 93, 94, 95, 95.5, 96, 96.5, 97, 97.5, 

98, 98.5, 99, 99.5% similar to the sequence of amino acids 1-441 
of basidio consensus as shown in Fig. 3. 

Phytases related to consensus phytase 12 
10 A phytase which comprises an amino acid sequence which is 

at least 70, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 

88, 89, 90, 91, 92, 93, 94, 95, 95.5, 96, 96.5, 97, 97.5, 98, 
98-5,. 99, 99.5% identical to the sequence of amino acids 1-467 
of consensus phytase 12 as shown in Fig. 21. 

15 A phytase which comprises an amino acid sequence which is 

at least 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 

89, 90, 91, 92, 93, 94, 95, 95,5, 96, 96.5, 97, 97.5, 98, 98.5, 

99, 99.5% similar to the sequence of amino acids 1-467 of 
consensus phytase 12 as shown in Fig. 21. 

20 



en 

(U 

u 
c 

OJ 
OJ 

tn 

i 



S' 

•H 
TJ 
O 
U 
C 
OJ 

O) 
CO 
03 
4J 

^' 

Pi 
U-l 

o 
o 

cn 

-H 

CD 



O 



. f 8* 

< (8 — 



£1 



Oi 43 in 

o 4J a 



a. Jij 

O 4J 



S 5 H 



t 



01 

g 

to 



ID 

i 



5808.010/220999/KaPe 



71 

References : 

Akanuma, S., Yamagishi, A., Tanaka, N, & Oshima, T. 
(1998). Serial increase in the thermal stability of 3- 
isopropylmalate dehydrogenase from Bacillus subtilis by 
5 experimental evolution. Prot. Sci. 1, 698-705. 

Arase, A., Yomo,. T., Urabe, I., Hata, Y., Katsube, Y. & 
Okada, H. (1993). Stabilization of xylanase by random 
mutagenesis. FEES Lett. 316, 123-127, 

Berka, R. M., Rey, M. W., Brown, K. M. , Byun, T. & Klotz, 
10 A. V. (1998) , Molecular characterization and expression of a 
phytase gene from the thermophilic fungus Thermomyces 
lanuginosus. Appl. Environ. Microbiol. 64, 4423-4427. 

Blaber, M., Lindstrom, J. D., Gassner, N., Xu, J., Heinz, 
D. W. & Matthews, B. W. (1993) . Energetic cost and structural 
15 consequences of burying a hydroxyl group within the core of a 
protein determined from Ala^Ser and Val^Thr substitutions in T4 
lysozyme. Biochemistry 32, 11363-11373. 

Brugger, R., Mascarello, F., Augem, S-, van Loon, A. P. G. 
M. & Wyss, M. (1997) . Thermal denaturation of phytases and pH 
20 2.5 acid phosphatase studied by differential scanning 
calorimetry. In The Biochemistry of phytate and phytase (eds. 
Rasmussen, S.K; Raboy, V.; Dalb0ge, H. and Loewus, F. ; Kluwer 
Academic Publishers . 

Cosgrove, D.J. (1980) Inositol phosphates - their 
25 chemistry, biochemistry and physiology: studies in organic 
chemistry, chapter 4. Elsevier Scientific Publishing Company, 
Amsterdam, Oxford, New York. 

Devereux, J., Haeberli, P.& Smithies, O. (1984) A 
comprehensive set of sequence analysis programs for the VAX. 
30 Nucleic Acids Res. 12, 387-395. 



5808.010/220999/KaPe 



72 

Gellissen, G., Hollenberg, C. P., Janowicz, Z. A. (1994) 
Gene expression in methylotrophic yeasts . In: Smith, A. (ed. ) 
Gene expression in recombinant microorganisms- Dekker, New York, 
395-439. 

5 Gellissen, G., Piontek, M. , Dahlems, U., Jenzelewski, V. , 

Gavagan, J. E., DiCosimo, R., Anton, D. I. & Janowicz, Z. A. 
(1996) Recombinant Hansenula polymorpha as a biocatalyst: 
coexpression of the spinach glycolate oxidase (GO) and the S, 
cerevisiae catalase T (CTTl) gene. Appl . Microbiol. Biotechnol. 
10 46, 46-54. 

Gerber, P. and Muller, K. (1995) Moloc molecular modeling 
software. J. Comput. Aided Mol . Des. 9, 251-268 

Hinnen, A., Hicks, J. B. & Fink, G, R. (1978) 
Transformation of yeast. Proc. Natl. Acad. Sci. USA 75, 1929- 
15 1933. 

Imanaka, T., Shibazaki, M. & Takagi, M. (1986), A new way 
of enhancing the thermostability of proteases. Nature 324, 695- 
697. 

Janes, M. , Meyhack, B., Zimmermann, W. & Hinnen, A. (1990) 
20 The influence of GAP promoter variants on hirudine production, 
average plasmid copy number and cell growth in Saccharomyces 
cerevisiae. Curr. Genet- 18, 97-103. 

Karpusas, M., Baase, W. A., Matsumura, M, & Matthews, B. 
W. (1989) . Hydrophobic packing in T4 lysozyme probed by cavity- 
25 filling mutants. Proc. Natl. Acad. Sci. (USA) 86, 8237-8241. 

Margarit, I., Campagnoli, S., Frigerio, F. , Grandi, G., 
Fillipis, V. D. & Fontana, A. (1992) . Cumulative stabilizing 
effects of glycine to alanine substitutions in Bacillus subtilis 
neutral protease. Prot . Eng. 5, 543-550. 
30 Matthews, B. W. (1987a) . Genetic and structural analysis 

of the protein stability problem. Biochemistry 26, 6885-6888. 



5808 . 010/220999/KaPe 



73 

Matthews, B. W. (1993) . Structural and genetic analysis of 
protein stability. Annu. Rev. Biochem. 62, 139-160. 

Matthews, B. W., Nicholson, H. & Becktel, W. (1987). 
Enhanced protein thermostability from site-directed mutations 
5 that decrease the entropy of unfolding. Proc. Natl. Acad. Sci. 
(USA) 84, 6663-6667, 

Mitchell, D. B., Vogel, K. , Weimann, B. J., Pasamontes, L. 

s 

& van Loon, A, P. G. M. (1997) The phytase subfamily of 
histidine acid phosphatases: isolation of genes for two novel 

10 phytases from the fungi Aspergillus terreus and Myceliophthora 
thermophila. Microbiology 143, 245-252. 

Mullaney, E. J. , Hamer, J. E., Roberti, K. A., Yelton, M. 
M. & Timberlake, W. E. (1985) Primary structure of the trpC gene 
from Aspergillus nidulans. Mol . Gen. Genet. 199, 37-46. 

15 Munoz, V. & Serrano, L. (1995). Helix design, prediction 

and stability. Curr. Opin. Biotechnol. 6, 382-386, 

Pace, N* C. , Vajdos, F. , Fee, L. , Grimsley, G. & Gray, T. 
(1995) . How to measure and predict the molar absorption 
coefficient of a protein. Prot. Sci, 4, 2411-2423, 

20 Pantoliano, M. W. , Landner, R. C., Brian, P. N., Rollence, 

M. L., Wood, J. F. & Poulos, T, L. (1987). Protein engineering 
of subtilisin BPN': enhanced stabilization through the 
introduction of two cysteines to form a disulfide bond. 
Biochemistry 26, 2077-2082. 

25 Pasamontes, L. , Haiker, M., Henriquez-Huecas, M., 

Mitchell, D, B. & van Loon, A. P. G, M. (1997a) . Cloning of the 
phytases from Emericella nidulans and the thermophilic fungus 
Talaromyces thermophilus . Biochim. Biophys. Acta 1353, 217-223. 

Pasamontes, L., Haiker, M., Wyss, M., Tessier, M. & van 

30 Loon, A. P. G. M. (1997) Cloning, purification and 



5808.010/220999/KaPe 



74 

characterization of a heat stable phytase from the fungus 
Aspergillus fumigatus, Appl , Environ. Microbiol. 63, 1696-1700. 

Piddington, C. S., Houston, C. S., Paloheimo, M., 
Cantrell, M., Miettinen-Oinonen, A. Nevalainen, H., & Rambosek, 
5 J. (1993) The cloning and sequencing of the genes encoding 
phytase (phy) and pH 2.5-optiinum acid phosphatase (aph) from 
Aspergillus niger var. awamori. Gene 133, 55-62. 

Purvis, I. J. / Bettany, A. J- E. , Santiago, T. C. , 
Coggins, J. R. , Duncan, K., Eason, R. & Brown, A. J. P. (1987). 
10 The efficiency of folding of some proteins is increased by 
controlled rates of translation in vivo. J. Mol . Biol. 193, 413- 
417. 

Risse, B., Stempfer, G., Rudolph, R., Schumacher, G- & 
Jaenicke, R. (1992) . Characterization of the stability effect of 
15 point mutations of pyruvate oxidase from Lactobacillus 
plantarum: protection of the native state by modulating coenzyme 
binding and subunit interaction. Prot. Sci. 1, 1710-1718. 

Sambrook, J., Fritsch, E. F. & Maniatis, T. (1989) 
Molecular Cloning: A Laboratory Manual, 2nd Ed., Cold Spring 
20 Harbor Laboratory, Cold Spring Harbor, NY. 

Sauer, R. , Hehir, K. , Stearman, R. , Weiss, M. , Jeitler- 
Nilsson, A., Suchanek, E. & Pabo, C. (1986). An engineered 
intersubunit disulfide enhances the stability and DNA binding of 
the N-terminal domain of 1-repressor, Biochemistry 25, 5992- 
25 5999. 

Serrano, L., Day, A. G. & Fersht, A. R. (1993). Step-wise 
mutation of barnase to binase. A procedure for engineering 
increased stability of proteins and an experimental analysis of 
the evolution of protein stability. J. Mol, Biol. 233, 305-312. 



5808 . 010/220999/KaPe 



75 

Sheman, J. P., Finck, G. R. & Hicks, J. B. (1986) 
Laboratory course manual for methods in yeast genetics- Cold 
Spring Harbor University, 

Steipe, B., Schiller, B., Plueckthun, A. & Steinbach, S. 
5 (1994) . Sequence statistics reliably predict stabilizing 
mutations in a protein domain, J. Mol. Biol. 240, 188-192. 

van den Burg, B., Vriend, G., Veltman, O, R. , Venema & G., 
Eijsink, V. G. H. (1998) . Engineering an enzyme to resist 
boiling. Proc. Natl. Acad. Sci. (USA) 95, 2056-2060. 
10 Van Etten, R.L. (1982) Human prostatic acid phosphatase: a 

histidine phosphatase. Ann. NY Acad. Sci. 390,27-50 

van Hartingsveldt, W., van Zeijl, C. M. F. , Harteveld, G. 
M., Gouka, R. J., Suykerbuyk, M. E. G., Luiten, R. G. M. , van 
Paridon, P. A., Selten^ G. C. M., Veenstra, A. E., van Gorcom, 
15 R. F. M., & van den Hondel, C. A. M. J. J. (1993) Cloning, 
characterization and overexpression of the phytase-encoding gene 
(phyA) of Aspergillus niger. Gene 127, 87-94. 

van Loon, A. P. G. M. , Simoes-Nunes, C, Wyss, M., 
Tomschy, A., Hug, D., Vogel, K. & Pasamontes, L. (1998). A heat 
20 resistant phytase of Aspergillus fumigatus with superior 
performance in animal experiments. Phytase optimization and 
natural variability. In the Biochemistry of phytate and 
phytases. Kluwer Academic Press, s.a. 



5808 . 010/220999/KaPe 



76 



SEQUENCE LISTING 
to follow! 



5808 . 010/220999/KaPe 



77 

CLAIMS 

1. A phytase which comprises an amino acid sequence which is at 
least 93.80% identical to the sequence of amino acids 1-4 67 of 

5 consensus phytase 10 (FcplO) as shown in Fig. 5. 

2. A phytase which is encoded by a DNA sequence which is at 
least 95.88% identical to nucleotides 12-1412 of the DNA 
sequence of consensus phytase 10 (FcplO) as shown in Fig. 5. 

10 

3. A phytase which comprises an amino acid sequence selected 
from amongst the amino acid sequence of Fig. 5, and the amino 
acid sequence designated FcplO shown in Fig. 4 (consensus 
phytase 10) . 

15 

4 . A phytase which comprises an amino acid sequence selected 
from amongst the amino acid sequences of (i) consensus phytase- 
10-thermo[3] (also designated consensus phytase-lO-thermo) ; (ii) 
the variant Q50T, (iii) K91A, or (iv) (Q50T+K91A) thereof - 
20 variant (iv) is shown in Fig. 8; and (v) amino acids 27-467 of 
any of the sequences (i) to (iv) . 

5. A phytase which comprises an amino acid sequence selected 
from amongst the amino acid sequences of (i) consensus phytase- 
25 l-thermo[8] (also designated consensus phytase-l-thermo) ; (ii) 
the variant Q50T, (iii) K91A, or (iv) (Q50T+K91A) thereof - 
variant (iv) is shown in Fig. 7; and (v) amino acids 27-4 67 of 
any of the sequences of (i) to (iv) . 

30 6. A phytase which comprises the amino acid sequence of 
consensus phytase-11 shown in Fig. 6. 
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7. A DNA sequence which encodes the phytase of any one of 
claims 1-6- 

5 8, A DNA sequence which encodes a phytase, and which is (i) at 
least 95.88% identical, or (ii) hybridizes under high stringency 
conditions, to nucleotides 12-1412 of the DNA sequence of 
consensus phytase 10 (FcplO) as shown in Fig. 5. 

10 9. A DNA sequence which encodes a phytase comprising an amino 
acid sequence which is at least 93.80% identical to the sequence 
of amino acids 1-467 of consensus phytase 10 (FcplO) as shown in 
Fig. 5. 

15 10. A DNA sequence which encodes a phytase, and which comprises 

(i) nucleotides 1-1426, 12-1412, 90-1426 or 90-1412 of 
the DNA sequence encoding consensus phytase 10 (Fcp 10) shown 
in Fig. 5; 

(ii) nucleotides 1-1404, 1-1401, 79-1404, or 79-1401 of 
20 the DNA sequence encoding consensus phytase-lO-thermo [3] -Q50T, 

K91A shown in Fig. 8; 

(iii) variants of the nucleotides of (ii) which encodes 
variants (i)-(iii) of claim 4; 

(iv) nucleotides 1-1410, 1-1407, 79-1410 or 79-1407 of 
25 the DNA sequence encoding consensus phytase-l-thermo [ 8 ] -Q50T, 

K91A as shown in Fig. 7; 

(v) variants of the nucleotides of (iv) which encodes 
variants (i)-(iii) of claim 5; or 

30 11, A vector comprising the DNA sequence according to any one 
of claims 3-5. 
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12. A host cell comprising the DNA sequence according to any 
one of claims 3-5 or the vector according to claim 6. 

5 13. A process for producing a phytase, the process comprising 
culturing the host cell according to claim 7 under conditions 
permitting the production of the phytase, and recovering the 
phytase from the culture broth. 

10 14. A food, feed or pharmaceutical composition comprising the 
phytase of any one of claims 1-2. 
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ABSTRACT 

This invention relates to improved phytases, preferably 
phytases of an increased thermostability, and a process of 
producing them based on a comparison with consensus phytases. In 
5 particular, stabilizing amino acid mutations are introduced into 
a homologous protein, or the active site of a phytase is 
replaced in part or in toto. The corresponding DNA sequences 
and methods of preparing it is also disclosed, as are methods of 
producing the improved phytases, and the use thereof. Specific 
10 variants of Aspergillus fumigatus phytase and consensus phytases 
are disclosed. 
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A. terreus 9A-1 KhsDCNSVDh GYQCFPELSH kWGlYAPYFS 

A. terreus cbs NhsDCTSVDr GYQCFPELSH kWGlYAPYFS 
A. nlger var. awamori NqsTCDTVDQ GYQCFSETSH LWGQYAPFFS 

A. niger T213 NqsSCDTVDQ GYQCFSETSH LWGQYAPFFS 

A. niger NRRL3135 NqsSCDTVDQ GYQCFSETSH LWGQYAPFFS 

A. fumigatus 13073 GSkSCDTVDl GYQCsPATSH LWGQYSPFFS 

A. fujrigatus 32722 GSkSCDTVDl GYQCsPATSH LWGQYSPFFS 

A. fumig-atus 58128 GSkSCDTVDl GYQCsPATSH LWGQYSPFFS 

A. fumigatus 26906 GSkSCDTVDl GYQCsPATSH LWGQYSPFFS 

A. fumigatus 32239 GSkACDTVEl GYQCsPGTSH LWGQYSPFFS 

E, nidulans QNHSCNTADG GYQCFPNVSH VWGQYSPYFS 

T. thermophilus DSHSCNTVEG GYQCrPEISH sWGQYSPFFS 

Af. thennophila ESRPCDTpDl GFQCgTAISH FWGQYSPYFS 

Consensus 
Consensus phytase 



LQDESPFPID 
LQDESPFPID 
LANESAISPD 
LANESVISPD 
LANESVISPE 
LEDElSVSSK 
LEDEISVSSK 
LEDElSVSSK 
LEDEISVSSK 
LEDEISVSSD 
lEQESAISeD 
LADQSEISPD 
VpSElDaS. . 



50 

VPEDChlTFV 
VPDDChlTFV 
VPAGCrVTFA 
VPAGCrVTFA 
VPAGCrVTFA 
LPKDCrlTLV 
LPKDCrlTLV 
LPKDCrlTLV 
LPKDCrlTLV 
LPKDCrVTFV 
VPHGCeVTFV 
VPQNCklTFV 
IPDDCeVTPA 



NSHSCDTVDG GYQCFPEISH LWGQYSPYFS LEDESAISPD VPDDC-VTFV 
NSHSCDTVDG GYQCFPEISH LWGQYSPYPS LEDESAISPD VPDDCRVTFV 
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A. terreus 9A-1 QVLARHGARs PThSKtKAYA AtlAAIQKSA TaFpGKYAFL QSYNYSLDSE 

A. terreus cbs QVLARHGARs PTDSKtKAYA AtlAAIQKNA TaLpGKYAFL KSYNYSMGSE 
A. niger var. awamori QVLSRHGARY PTESKgKkYS ALIEEIQQNV TtFDGKYAFL KTYNYSLGAD 

A. niger T213 QVLSRHGARY PTESKgKkYS ALIEEIQQNV TtFDGKYAFL KTYNYSLGAD 

A. niger NRRL3135 QVLSRHGARY PTDSKgKkYS ALIEEIQQNA TtFDGKYAFL KTYNYSLGAD 

A. fumigatus 13073 QVLSRHGARY PTSSKsKkYK kLVTAIQaNA TdFKGKFAPL KTYNYTLGAD 

A. fumigatus 32722 QVLSRHGARY PTSSKsKkYK kLVTAIQaNA TdFKGKFAFL KTYNYTLGAD 

A. fumigatus 58128 QVLSRHGARY PTSSKsKkYK kLVTAIQaNA TdFKGKFAFL KTYNYTLGAD 

A. fumigatus 26906 QVLSRHGARY PTSSKsKkYK kLVTAIQaNA TdFKGKFAFL KTYNYTLGAD 

A. fumigatus 32239 QVLSRHGARY PTASKsKkYK kLVTAIQKNA TeFKGKFAFL ETYNYTLGAD 

K. nidulans QVLSRHGARY PTESKsKAYS GLIEAIQKNA TsFwGQYAFL ESYNYTLGAD 

T. thermophilus QLLSRHGARY PTSSKtElYS QLISrIQKTA TaYKGyYAFL KDYrYqLGAN 

M. thermophila QVLSRHGARa PTlKRaaSYv DLIDrlHhGA IsYgPgYEFL RTYDYTLGAD 

Consensus QVLSRHGARY PTSSK-KAYS ALIEAIQKNA T-FKGKYAFL KTYNYTLGAD 

Consensus phytase QVLSRHGARY PTSSKSKAYS ALIBAIQKNA TAFKGKYAFL KTYNYTLGAD 
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A. terz-eus 9A-1 ELTPFGrNQL rDlGaQFYeR YNALTRhlnP 

A. terreus cbs NLTPFGrNQL qDlGaQFYRR YDTLTRhlnP 

A. niger var, awamori DLTPFGEQEL VNSGIKPYQR YESLTRNIIP 

A. niger T213 DLTPFGEQEL VNSGIKPYQR YESLTRNIIP 

A. niger NRRL3135 DLTPFGEQEL VNSGIKPYQR YESLTRNIVP 

A. fumigatus 13073 DLTPFGEQQL VNSGIKPYQR YKALMSWP 

A. fumigatus 32722 DLTPFGEQQL VNSGIKPYQR YKALARSWP 

A. fumigatus 58128 DLTPFGEQQL VNSGIKPYQR YKALARSWP 

A. fumigatus 26906 DLTAFGEQQL VNSGIKPYQR YKALARSWP 

A, fumigatus 32239 DLTPFGEQQM VNSGIKPYQK YKALAgSWP 

E. nidulans DLTiFGENQM VDSGaKFYRR YKNLARKnTP 

T. thermophilus DLTPPGENQM IQlGIKFYnH YKSLARNaVP 

M. thermophila ELTRtGQQQM VNSGIKPYRR YRALARKsIP 



FVRATDASRV 
FVRAADSSRV 
FIRSSGSSRV 
FIRSSGSSRV 
FIRSSGSSRV 
FIRASGSDRV 
FIRASGSDRV 
FIRASGSDRV 
FIRASGSDRV 
FIRSSGSDRV 
FIRASGSDRV 
FVRCSGSDRV 
FVRTAGqDRV 
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hESAEKFVEG 
hESAEKFVEG 
lASGEKFIEG 
lASGEKFIEG 
lASGKKFIEG 
lASGEKFIEG 
lASGEKFIEG 
lASGEKFIEG 
lASGEKFIEG 
lASGEKFIEG 
VASAEKPING 
lASGrlFIEG 
VhSAENFTQG 



Consensus DLTPPGENQM VNSGIKPYRR YKALARK-VP FVRASGSDRV lASAEKFIEG 

Consensus phytase DLTPPGENQM VNSGIKFYRR YKALARKXVP FIRASGSDRV lASAEKriEG 
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A. terreuB 9A-1 

A. terreus cbs 

A. jilger var. awamori 

A. niger T213 

A. ijlger NRRli3i35 

A. fuiTiigatus 13073 

A. Tuini^atus 32722 

A. fxmigatus 58128 

A. fu/ni^atus 26906 

A. fxmigatus 32239 

E, nidulans 

r. thermophilua 

M. thermophi 1 a 



151 

FQTARqDDHh 
FQNARqGDPh 
FQSTKLkDPr 
FQSTKLkDPr 
FQSTKLkDPr 
FQqAKLADPG 
FQqAKLADPG 
FQqAKLADPG 
FQqAKLADPG 
FQqANVADPG 
FRKAQLhDHG 
FQSAKVlDPh 
FHSAILADRG 



ANpHQPSPrV 
ANpHQPSPrV 
AqpgQSSPkl 
AqpgQSSPkl 
AqpgQSSPkl 
A.TWRAAPAI 
A.TNRAAPAI 
A.TNRAAPAI 
A.TNRAAPAI 
A.TNRAAPVI 
S. .gQATPW 
SDkHDAPPTI 
STvRPTlPyd 



DValPEGSAY 
DWIPEGTAY 
DWISEASSs 
DWISEASSs 
DWISEASSs 
SVIIPESETF 
SVIIPESETF 
SVIIPESETF 
SVIIPESETF 
SVIIPESETY 
NVIIPEiDGF 
NVIIeEGPSY 
mWIPETAGa 



NNTLEHSlCT 
NNTLEHSICT 
NNTLDPGTCT 
NNTLDPGTCT 
NNTLDPGTCT 
NNTLDHGVCT 
NNTLDHGVCT 
NNTLDHGVCT 
NNTLDHGVCT 
NNTLDHSVCT 
NNTLDHSTCV 
NNTLDtGSCP 
NNTLHNDICT 



200 

AFES . . , STV 



AFEA. 
VFED. 
VFED. 
VFED. 
kFEA. , 
kFEA. . 



.STV 
.SEL 
.SEL 
.SEL 
.SQL 
.SQL 



kFEA. , .SQL 
kFEA. . .SQL 
NFEA. . .SEL 
SFEN. . .DEr 
VFED . . . SSg 
AFEEgpySTX 



Consensus 
Consensus phytase 



FQSAKLADPG S-PHQASPVI NVIIPEGSGY NNTLDHGTCT AFED---SEL 
FQSAKLADPG SQPHQASPVl DVIIPEGSGY NNTLDHGTCT APED... SEL 



A. terreus 9A-1 GDDAvANFTA VFAPAIaQRL EADLPGVqLS TDDWnLMAM CPFETVSITD 

A. terreus cbs GDAAADNFTA VFAPAIakRL EADLPGVqLS ADDWnLMAM CPFETVSITD 
A. niger var. awamori ADTVEANFTA TFAPSIRQRL ENDLSGVTLT DTEVTyLMDM CSFDTIStST 

A. niger T213 ADTVEANFTA TFAPSIRQRL ENDLSGVTLT DTEVTyLMDM CSFDTIStST 

A. niger NRRL3135 ADTVEANFTA TFVPSIRQRL ENDLSGVTLT DTEVTyLMDM CSFDTIStST 

A. fumigatus 13073 GDEVAANFTA iFAPDIRARa EkHLPGVTLT DEDWsLMDM CSFDTVARTS 

A. fumigatus 32722 GDEVAANFTA IFAPDIRARa EkHLPGVTLT DEDWsLMDM CSFDTVARTS 

A. fumigatus 58128 GDEVAANFTA IFAPDIRARa EkHLPGVTLT DEDWsLMDM CSFDTVARTS 

A. fumigatus 26906 GDEVAANFTA IFAPDIRARa KkHLPGVTLT DEDWsLMDM CSFDTVARTS 

A. fumigatus 32239 GDEVEANFTA IFAPAIRARI EkHLPGVqLT DDDWsLHDM CSFDTVARTA 

B. nidulans ADEiEANFTA IMGPPIRkRL ENDLPGIKLT NENVIyLMDM CSFDTMARTA 
T, t/jermophilus GHDAQEKFAk qFAPAIlEKI KDHLPGVDLA vSDVpyLMDL CPFETLARNh 
M, thermophila GDDAQDTYlS TFAGPItARV NANLPGANLT DADTVaLMDL CPFETVAsSS 

Consensus GDDAEANFTA TFAPAIRARL EADLPGVTLT DEDW-LMDM CPFETVARTS 

Consensus phytase GDDVEANFTA LFAPAIRARL EADLPGVTLT DBDWYLHDM CPFETVARTS 



251 300 

A. terreus 9A-1 DAhTLSPFC DLFTAtEWtq YNYLlSLDKY YGYGGGNPLG 

A. terreus cbs DAhTLSPFC DLFTAaEWtq YNYLlSLDKY YGYGGGNPLG 

A. niger var. awamori vDTKLSPFC DLFTHdEWih YDYLQSLkKY YGHGAGNPLG 

A. niger T213 vDTKLSPFC DLFTHdEWih YDYLRSLkKY YGHGAGNPLG 

A. niger NRRL3135 vDTKLSPFC DLFTHdEWin YDYLQSLkKY YGHGAGNPLG 

A. fumigatus 13073 DASQLSPFC QLFTHnEWkk YNYLQSLGKY YGYGAGNPLG 

A. fumigatus 32722 DASQLSPFC QLFTHnEWkk YNYLQSLGKY YGYGAGNPLG 

A. fumigatus 58128 DASQLSPFC QLFTHnEWkk YNYLQSLGKY YGYGAGNPLG 

A. fumigatus 26906 DASQLSPFC QLFTHnEWkk YNYLQSLGKY YGYGAGNPLG 

A. fumigatus 32239 DASELSPFC AIFTHnEWkk YDYLQSLGKY YGYGAGNPLG 

B, nidulans HGTELSPFC AIFTEkEWlq YDYLQSLSKY YGYGAGSPLG 

T. thermophilus TDT.LSPPC ALsTQeEWqa YDYYQSLGKY YGnGGGNPLG 

AT. thermophila sdpatadagg gNGrpLSPFC rLFSEsEWra YDYLQSVGKW YGYGPGNPLG 

Consensus -DATELSPFC ALFTE-EW-- YDYLQSLGKY YGYGAGNPLG 

Consensus phytase DATELSPFC ALFTHDEWRQ YDYLQSLGKY YGYGAGNPLG 
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A. terreus 9A-1 

A, terreus cbs 

A. niger var. awamori 

A. niger T213 

A. ni^er NRRL3135 

A. fumigatus 13073 

A. fumigratus 32722 

A. fumigatuB 58126 

A. ftunlgatus 26906 

A. fumigatus 32239 

nidulans 

T , thermoph ilua 

M. thermophila 

Consensus 
Consensus phytase 



301 

PVQGVGWaNE 
PVQGVGWaNE 
PTQGVGYaNE 
PTOGVGYaNE 
PTQGVGYaNE 
PAQGIGFtNE 
PAQGIGFtNE 
PAQGIGFtNE 
PAQGIGFtNE 
PAQGIGFtNE 
PAQGIGFtNE 
PAQGVGPvNE 
PTQGVGFvNE 



LMARLTRAPV 
LIARLTRSPV 
LIARLTHSPV 
LIARLTHSPV 
LIARLTHSPV 
LIARLTRSPV 
LIARLTRSPV 
LIARLTRSPV 
LIARLTRSPV 
LIARLTNSPV 
LIARLTQSPV 
LIARMTHSPV 
LLARLAgvPV 



HDHTCVNNTL 
HDHTCVNNTL 
HDDTSSNHTL 
HDDTSSNHTL 
HDDTSSNHTL 
QDHTSTNsTL 
QDHTSTNsTL 
QDHTSTNsTL 
QDHTSTNsTL 
QDHTSTNsTL 
QDNTSTNHTL 
QDYTTVNHTL 
RD9TSTNRTL 



DASPATFPLN 
DANPATFPLN 
DSNPATPPLN 
DSNPATFPLN 
DSSPATPPLN 
vSNPATFPLN 
vSNPATFPLN 
vSNPATFPLN 
vSNPATFPLN 
DSDPATFPLN 
DSNPATFPLD 
DSNPATFPLN 
DGDPrTFPLG 



350 

ATLYADFSHD 
ATLYADFSHD 
STLYADPSHD 
STLYADFSHD 
STLYADFSHD 
ATMYVDPSHD 
ATMYVDFSHD 
ATMYVDPSHD 
ATMYVDFSHD 
ATIYVDFSHD 
rKLYADFSHD 
ATLYADFSHD 
rPLYADFSHD 



PAQGVGF-NE LIARLTHSPV QDHTSTNHTL DSNPATFPLN ATLYADFSHD 
PAQC3V0PANE LIARLTRSPV QDHTSTNHTL DSNPATFPLN ATLYADFSHD 



A . terreua 9A- 1 

A. terreus cbs 

A. niger var. awamori 

A. niger T213 

A. niger NRRL313S 

A. fumigatus 13073 

A. fumigatus 32722 

A. fumigatus 58128 

A. fumigatus 26906 

A. fumigatus 32239 

E, nidulans 

T. thermophilus 

M . thermophila 



351 

SNLVSIFWAL 
SNLVSIFWAL 
NGIISILFAL 
NGIISILFAL 
NGIISILFAL 
NSMVSIFFAL 
NSMVSIFFAL 
NSMVSIFFAL 
NSMVSIFFAL 
NGMIPIFFAM 
NSMISIFFAM 
NTMTSIFaAL 
NDMMGVL9AL 



GLYNGTAPLS 
GLYNGTkPLS 
GLYNGTkPLS 
GLYNGTkPLS 
GLYNGTkPLS 
GLYNGTEPLS 
GLYNGTGPLS 
GLYNGTEPLS 
GLYNGTEPLS 
GLYNGTEPLS 
GLYNGTQPLS 
GLYNGTAkLS 
GaYDGVPPLD 



qTSVESVSQT 
qTTVEDITrT 
TTTVENITQT 
TTTVENITQT 
TTTVENITQT 
rTSVESaKEl 
rTSVESaKEl 
rTSVESaKEl 
rTSVESaKEl 
qTSeESTKES 
tnDSVESIQEm 
TTEIKSIEET 
KTArrDpEEl 



DGYAAAWTVP 
DGYAAAWTVP 
DGFSSAWTVP 
DGPSSAWTVP 
DGFSSAWTVP 
DGYSASWWP 
DGYSASWWP 
DGYSASWWP 
DGYSASWWP 
NGYSASWAVP 
DGYAASWTVP 
DGYSAAWTVP 
GGYAASWAVP 



400 

FAARAYVEMM 
FAARAYIEMM 
PASRlYVEMM 
FASRIYVEMM 
FASRIYVEMM 
FGARAYFEtM 
FGARAYFEtM 
FGARAYFEtM 
FGARAYFEtM 
FGARAYFEtM 
FGARAYFELM 
FGGRAYIEMM 
FAARiYVEKM 



Consensus 
Consensus phytase 



NSMISIFPAL GLYNGTAPLS TTSVESIEET DGYAASWTVP FGARAYVEMM 
NSMISIPFAL GLYNGTAPLS TTSVBSIBBT DOYSASWTVP FGARAYVEMM 







401 








450 


A. 


terreus 9A-1 






PLVRVLVNDR 


VMPLHGCPTD 


KLGRCKrDAF 


A. 


ter-reus cbs 






PLVRVLVNDR 


VMPLHGCAVD 


NLGRCKrDDP 


A. 


niger var. awamori 


QC 




PLVRVLVNDR 


WPLHGCPID 


aLGRCTrDSF 


A. 


niger T213 






PLVRVLVNDR 


WPLHGCPID 


aLGRCTrDSF 


A. 


niger NRRL3135 


QC 




PLVRVLVNDR 


WPLHGCPVD 


aLGRCTrDSF 


A. 


fumigatus 13 073 






PLVRALINDR 


WPLHGCDVD 


KLGRCKLNDF 


A. 


fumigatus 32722 






PLVRALINDR 


WPLHGCDVD 


KLGRCKLNDF 


A. 


fumigatus 5812 B 


QC 




SLVRALINDR 


WPLHGCDVD 


KLGRCKLNDF 


A. 


fumigatus 26906 






PLVRALINDR 


WPLHGCDVD 


KLGRCKLNDF 


A. 


fumigatus 32239 






PLVRALINDR 


WPLHGCAVD 


KLGRCKLKDF 


E. 


nidulans 






PLVRVLVNDR 


WPLHGCAVD 


KFGRCTLDDW 


T. 


th ermoph ilus 






PWRVLVNDR 


WPLHGCEVD 


SLGRCKrDDF 


M, 


thermophila 


RCsggggggg 


ggegrQEKDE 


eMVRVLVNDR 


VMTLkGCGAD 


ErGMCTLErF 



Consensus QC ---QAEKE PLVRVLVNDR WPLHGCAVD KLGRCKLDDF 

Consensus phytase QC QAEKE PLVRVLVNDR WPLHGCAVD KLGRCKRDDP 
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A. terreus 9A-1 
A. terreus cbs 
A. ni^er var. awamori 
A, niger T213 
A. niger NKRL3135 
A. fimigatus 13073 
A. fuinigatus 32722 
A. fvmigatus 58128 
A. fimigatus 26906 
A. fumigatus 32239 
E, nidulans 
T. thermophilus 
M . th ermophi 1 a 

Consensus 
Consensus phytase 



451 471 

VAGLSFAQAG GNWADCF - 

VEGLSFARAG NWAECF 

VrGLSFARSG GDWAECsA — - 
VrGLSFARSG GDWAECFA — - 
VrGLSFARSG DWAECFA — 
VKGLSWARSG GNWGECFS — - 
VKGLSWARSG GNWGECFS — - 
VKGLSWARSG GNWGECFS — - 
VKGLSWARSG GNWGECFS — - 
VKGLSWARSG NSEQSFS — 
VEGLNFARSG GNWkTCFTl- - 
VrGLSFARqG GNWEGCYAas e 
lESMAFARGN GKWDICFA— - - 

VEGLSFARSG GNWAECFA-- - 
VBGLSFARSG GNWAECFA. . . 
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Figvire 2 



CP-l 

Eco RIMGVFVVLLS lATLFGST 
TATATGAATTCAISGGCGTGTTCGTCGTGCTACTGTCCATTGCCACCTTGTTCGGTTCCA 

1 + + + + + ^ gQ 

ATATACTTAAGTACCCGCACAAGCAGCACGATGACAGGTAACGGTGGAACAAGCCAAGGT 

SGTAIiGPRGNSHSCDTVDGG 

CATCCGGTACCGCCTTGGOTCCTCGTGGTAATTCTCACTCTTGTGACACTGTTGACGGTG 
61 + + + + + ^ 



GTAGGCCATGGCOGAACCCAGGAGCACCATTAAGAGTGAGAACACTGTGACAACTGCCAC 

CP-2 

CP-3 

YQCFPEISHIiWGQYSPYFSL 
GTTACCAATGTTTCCCAGAAATTTCTCACTTGTGGGGTCAATACTCTCCATACTTCTCTT 

121 + + + + + + 180 

CAATGGTTACAAAGGGTCTTTAAAGAGTGAACACCCCAGTTATGAGAGGTATGAAGAGAA 

EDESAISPDVPDDCRVTFVQ 
TGGAAGACGAATCTGCTATTTCTCCAGACGTTCCAGACGACTGTAGAGTTACTTTCGTTC 

181 + + + + + + 240 

ACCTTCTGCTTAGACOATAAAGAGGTCTGCAAGGTCTGCTGACATCTCAATGAAAGCAAG 
CP-4 

CP-5 

VIiSRHGARYPTSSKSKAYSA 
AAGTTTTGTCTAGACACGGTGCTAGATACCCAACTTCTTCTAAGTCTAAGGCTTACTCTG 

241 + + + + + + 300 

TTCAAAACAGATCTGTGCCACGATCTATGGGTTGAAGAAGATTCAGATTCCGAATGAGAC 

LIEAIQKNATAFKQKYAFIiK 
CTTTGATTGAAGCTATTCAAAAGAACGCTACTGCTTTCAAGGGTAAGTACGCTTTCTTGA 

301 -t- + + + + + 360 

GAAACTAACTTCGATAAGTTTTCTTGCGATGACGAAAGTTCCCATTCATGCGAAAGAACT 

CP- 6 

CP-7 

TYNYTLGADDLTPFGENQMV 
AGACTTACAACTACACTTTGGGTGCTGACGACTTGACTCCATTCGGTGAAAACCAAATGG 

361 + ' + + + + + 420 

TCTGAATGTTGATGTGAAACCCACGACTGCTGAACTGAGGTAAGCCACTTTTGGTTTACC 

NSGIKFYRRYKALARKIVPF 
TTAACTCTGGTATTAAGTTCTACAGAAGATACAAGGCTTTQGCTAGAAAGATTGTTCCAT 

421 + + + + + + 480 

AATTGAGACCATAATTCAAGATGTCTTCTATGTTCCGAAACCGATCTTTCTAACAAGGTA 

CP-B 

CP- 9 

IRASGSDRVIASAEKFIEGF 
TCATTAGAGCTTCTGGTTCTGACAGAGTTATTGCTTCTGCTGAAAAGTTCATTGAAGGTT 

481 + + + + + + 540 

AGTAATCTCGAAGACCAAGACTGTCTCAATAACGAAGACGACTTTTCAAGTAACTTCCAA 

QSAKLADPGSQPHQASPVID 
TCCAATCTGCTAAGTTGGCTGACCCAGGTTCTCAACCACACCAAGCTTCTCCAGTTATTG 

541 + + + + ■ + + 600 

AGGTTAGACGATTCAACCGACTGGGTCCAAGAGTTGGTGTGGTTCGAAGAGGTCAATAAC 
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CP- 10 

CP-11 

VI IPEGSGYNNTLDHGTCTA 

ACGTTATTATTCCAGAAGGaTCcGGTTACAACAACACTTTGGACCACGGTACTTGTACTG 
601 + + ^ ^ ^ ^ 

TGCAATAATAAGGTCrrTCCtAQgCCAATGTTGTTGTGAAACCTGGTGCCATGAACATGAC 

^ EDSELGDDVEANFTALFAP 

CTTTCGAAGACTCTGAATTGGGTGACGACOTTGAAGCTAACTTCACTGCTTTGTTCGCTC 

661 + + + + + ^ 

GAAAGCTTCTGAGACTTAACCCACTGCTGCAACTTCGATTGAAGTGACGAAACAAGCGAG 

CP-12 

AIRARLEADLPGVTLTDEDV 
CAOCTATTAGAGCTAGATTGGAAGCTGACTTGCCAGGTGTTACTTTGACTGACGAAGACG 
721 + + + + + + 780 

GTCOATAATCTCGATCTAACCTTCGACTGAACGGTCCACAATGAAACTGACTGCTTCTGC 

CP-13 

VYLMDMCPFETVARTSDATE 
TTGTTTACTTGATGGACATOTGTCCATTCGAAACTGTTGCTAGAACTTCTGACGCTACTG 

781 + + + + + ^ 840 

AACAAATGAACTACCTGTACACAGGTAAGCTTTGACAACGATCTTGAAGACTGCGATGAC 

LSPFCALFTHDEWRQYDYLQ 
AATTGTCTCCATTCTGTGCTTTGTTCACTCACGACGAATGGAGACAATACGACTACTTGC 
841 + + + + + + 900 

TTAACAGA6GTAAGACACGAAACAAGTGAGTGCTGCTTACCTCTGTTATGCTGATGAACG 
CP-14 

CP-15 

SLGKYYGYGAGNPLGPAQGV 

AATCTTTGGGTAAGTACTACGGTTACGGTGCTGGTAACCCATTGGGTCCAGCTCAAGGTG 

901 + ^ ^ ^ 

TTAGAAACCCATTCATGATGCCAATGCCACGACCATTGGGTAACCCAGGTCGAGTTCCAC 

GFANELIARLTRSPVQDHTS 
TTGGTTTCGCTAACGAATTGATTGCTAGATTOACTAGATCTCCAGTTCAAGACCACACTT 

961 + + + + + ^ 1020 

AACCAAAGCGATTGCTTAACTAACGATCTAACTGATCTAGAGGTCAAGTTCTGGTGTGAA 

CP-16 

CP-17 

TNHTLDSNPATFPLNATLYA 
CTACTAACCACACTTTGGACTCTAACCCAGCTACTTTCCCATTGAACGCTACTTTGTACG 
1021 + + + + + ^ 1080 

GATGATTGGTGTGAAACCTGAGATTGOGTCdATGAAAGGGTAACTTGCGATGAAACATGC 

DFSHDNSMTSIFFALGLYNG 
CTGACTTCTCTCACGACAACTCTATGATTTCTATTTTCTTCGCTTTGGGTTTGTACAACG 

1081 + + + - -- + + ^ 1140 

GACTGAAGAGAGTGCTGTTGAGATACTAAAGATAAAAGAAGCGAAACCCAAACATGTTGC 

CP-18 

CP-19 

TAPLSTTSVESIEETDGYSA 
GTACTGCTCCATTGTCTACTACTTCTGTTGAATCTATTGAAGAAACTGACGGTTACTCTG 
1141 + + + + + + 1200 

CATQACGAGGTAACAGATGATGAAGACAACTTAGATAACTTCTTTGACTGCCAATGAGAC 
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SWTVPFGARAYVEMMQCQAE 
CTTCTTGGACTOTTCCATTCGGTGCTAGAGCTTACGTTGAAATGATGCAATGTCAAGCTG 



1201 



+ 1260 



GAAGAACCTGACAAGGTAAGCCACGATCTCGAATGCAACTTTACTACGTTACAGTTCGAC 

CP-20 

CP-21 

KEPLVRVLVNDRVVPLHGCA 
AAAAGGAACCATTGGTTAGAGTTTTGGTTAACGACAGAGTTGTTCCATTGCACGGTTGTG 

1261 + + + + + + 1320 

TTTTCCTTGGTAACCAATCTCAAAACCAATTGCTGTCTCAACAAGGTAACGTGCCAACAC 



VDKLGRCKRDDFVEGLS 



F 



R 



CTGTTGACAAGTTGGGTAGATGTAAGAGAGACGACTTCGTTGAAGGTTTGTCTTTCGCTA 



1321 



+ 



+ 1380 



GACAACTGTTCAACCCATCTACATTCTCTCTGCTGAAGCAACTTCCAAACAGAAAGCGAT 



CP-22 



SGGNWAECFA* Eco RI 
GATCTGGTGGTAACTGGGCTGAATGTTTCGCTTAAGAATTCATATA 



1381 




CTAGACCACCATTGACCCGACTTACAAAGCGAATTCTTAAGTATAT 
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Figure 3 

1 50 
p. involutus {phyAl) SvP.KnTAPt FPIPeseQrn WSPYSPYFPL AeYkAPPAGC QlnQVNIIQR 
P. involutus {phyA2) SvP.RniAPK FSIPeseQrn WSPYSPYFPL AeYkAPPAGC EInQVNIIQR 
T. pubescens hiPlRdTSAc LdVTrDvQqs WSmYSPYFPa AtYvAPPASC QlnQVHIIQR 

A. pediades GgwQaTfvQ pfFPpQiQds WAAYTPYYPV qaYtPPPkDC KItQVNIIQR 

P, lycii StQfsfvAAQ LPTPaQntsn WGPYdPFFPV EpYaAPPEGC tVtQVNLIQR 

Basldio S-P-R-TAAQ LPIP-Q-Q-- WSPYSPYFPV A-Y-APPAGC QI-QVNIIQR 



51 100 

P. involutus {phyAl) HGARFPTSGA TTRIKAGLTK LQGvqnfTDA KFNFIkSfkY dLGnsDLVPF 

P. involutus {phyA2) HGARFPTSGA ATRIKAGLSK LQSvqnfTDP KFDFIkSfTY dLGtsDLVPF 

r. pubescens HGARFPTSGA AkRIQTAVAK LKAAsnyTDP iLAFVtNyTY sLGqDsLVeL 

A. pediades HGARFPTSGA GTRIQAAVkK LQSAktyTDP RLDFLtNyTY tLGhDDLVPF 

P. lycii HGARWPTSGA rSRqvAAVAK IQmArpfTDP KYEFLnDfvY kFGvADLLPF 

Basldio HGARFPTSGA ATRIQAAVAK LQSA TDP KLDFL-N-TY -LG-DDLVPF 



P . involutus 

P. involutus 

r. pubescens 

A. pediades 

P. lycii 



101 

(phyAl) GAaQSfDAGQ 
{phyA2) GAaQSfDAGl 
GAtQSSEAGQ 
GAIQSSQAGE 
GAnQShQTGt 



EAFARYSkLV 
EvFARYSkLV 
EAFTRYSSLV 
ETFqRYSfLV 
DmYTRYStLf 



SkNNLPFIRA 
SsDNLPFIRS 
SaDELPFVRA 
SkENLPFVRA 
egGDVPFVRA 



dGSDRWDSA 
dGSDRWDTA 
SGSDRWATA 
SSSNRWDSA 
AGdQRWDSS 



150 

TNWTAGFAs A 
TNWTAGF A s A 
nNWTAGFAlA 
TNWTEGFSaA 
TNWTAGFGdA 



Basldio 



GA-QSSQAGQ BAFTRYS-LV S-DNLPFVRA SGSDRWDSA TNWTAGFA-A 



151 200 

P. involutus {phyAD ShNTvqPkLn lilLPQtGNDT LEDNMCPaAG DSDPQvNaWL AVafPSITAR 

P. involutus {phyA2) SrNAiqPkLd LILPQtGNDT LEDNMCPaAG ESDPQvDaWL AsafPSVTAQ 

r. pubescens SsNSitPvLs VIISEaGNDT LDDNMCPaAG DSDPQvNqWL AqFAPPMTAR 

A. pediades ShHvlnPiLf VILSEslNDT LDDaMCPnAG sSDPQtGiWt SIYGTPIAnR 

P. lycii SgETVlPtLq WLqEeGNcT LcNNMCPnEv DGDest . tWL GVFAPnITAR 

Basldio S-NT--P-Ii- VILSB-QNDT LDDKMCP-AG DSDPQ-N-WL AVFAPPITAR 



P. involutus 

P. involutus 

T. puJbescens 

A. pediades 

P. lycii 



201 250 
(phyAl) LNAAAPSvNL TDtDAfNLvs LCAFlTVSkE kkSdFCtLFE giPGsFeAFa 
{phyA2) LNAAAPGANL TDaDAfNLvs LCPFmTVSkE qkSdFCtLFE giPGsFeAFa 
LNAGAPGANL TDtDTyNLlt LCPFETVAtE rrSeFCDIYE elQAE.dAFa 
liNqqAPGANI TAaDvsNLlp LCAFETIvkE tpSpFCNLF . . tPEEFaqFe 
LNAAAPSANL SDsDAltLmd MCPFDTLSsG naSpFCDLF. . tAEEYvSYe 



Basldio 



LMAAAPGANIi TD-DA-NL-- LCPFETVS-B --S-FCDLFE --PBBF-AF- 



251 300 
P. involutus iphyAl) YgGDLDKFYG TGYGQeLGPV QGVGYVNELI ARLTnsAVRD NTQTNRTLDA 
P. involutus {phyA2) YaGDLDKFYG TGYGQALGPV QGVGYINELL ARLTnsAVnD NTQTNRTLDA 
T. pui>escens YnADLDKFYG TGYGQPLGPV QGVGYINELI ARLTaQnVsD HTQTNsTLDS 

A. pediades YfGDLDKFYG TGYGQPLGPV QGVGYINELL ARLTemPVRD NTQTNRTLDS 
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p. lycii 
Basidlo 



YyyDLDKYYG TGpGNALGPV QGVGYVNELL ARLTgQAVRD ETQTNRTLDS 
Y-GDLDKPYG TGYGQPLGPV QGVGYINELL ARLT-QAVRD MTQTKRTLDS 



P. involutus 

P. involutus 

T. pubescens 

A. pediades 

P. lycil 



301 

(phyAl) SPvTFPLNKT FYADFSHDNl MVAVFSAMGL FrQPAPLsTS 
{phyA2) APdTFPLNKT MYADFSHDNl MVAVFSAMGL FrQSAPLsTS 
SPeTFPLNRT LYADFSHDNQ MVAIFSAMGL FNQSAPLDPT 
SPlTFPLDRS lYADLSHDNQ MIAIFSAMGL FNQSSPLDPS 
dPaTFPLNRT FYADFSHDNt MVPIFAALGL FNaTA.LDPl 



350 

vPNPwRTWrT 
tPDPNRTWLT 
tPDPaRTFLv 
f PNPKRTWVT 
kPDeNRlWVd 



Basidlo 



SP-TPPLNRT PYADFSHDNQ MVAIFSAMGL PNQSAPLDPS -PDPNRTWVT 



P. 
P. 

r. 

A. 

p. 



involutus 

involutus 

puhescBns 

pediades 

lycii 



351 

iphyAl) SsLVPFSGRM WERLsC. 
{phyA2) SsWPFSARM aVERLsC. 

kKIVPFSARM WERLdC. 



400 



.f GT tkV RVLVQDqVQP 

-a GT tkV RVLVQDqVQP 

.g GA qsV RLLVNDAVQP 

SRLtPFSARM VtERLlCqrd GTgsggpsri mrngnvqtfv RILVNDALQP 
SKLVPFSGHM tVEKLaC sgkeaV RVLVNDAVQP 



Basidlo 



SKLVPFSARH WERL-C- 



GT- 



-V RVLVNDAVQP 
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P. involutus (phyAl) 
p. involutus {phyA2) 
T. pubescens 
A, pediades 
P. lycii 

Basidio 
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401 

LEPCGGDrNG ICTLAkFVES 
LEFCGGDqDG ICALDkFVES 
LAFCGADtsG vCTLDAFVES 
LKFCGGDmDS ICTLEAFVES 
LEFCGG.vDG vCeLsAFVES 

LBFC66D-Da -CTLDAPVBS 



441 

QtFARsDGaG DFEKCPATSa - 
QaYARsGGaG DFEKCLATTv - 
OaYARNDGEG DFEKCFAT — - 

OkYAREDGQG DFEKCFD - 

QtYARENGOG DPAKCgfvPs e 

Q-YARSDGQG DPEKCPATP- - 
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Figure 4 



A. terreus 9al KhsdCNSVDh GYQCfPELSH kWGlYAPYFS LqDESPFPlD VPeDCHITFV 

A. terreus cbs NhsdCtSVDr GYQCfPELSH kWGlYAPYFS LqDESPFPlD VPdDCHITFV 
A. jaiger var. awamori NqsTCDTVDq GYQCfSEtSH LWGQYAPFFS LANESAISPD VPaGCRVTPa 

A. nigGr NRRL3135 NqsSCDTVDq GYQCfSEtSH LWGQYAPFFS LANESvISPE VPaGCRVTFa 

A. fumigatus 13073 GSkSCDTVDl GYQCsPAtSH LWGQYSPFFS LEDEISVSSK LPkDCRITLV 

A. fumigatus 32722 GSkSCDTVDl GYQCsPAtSH LWGQYSPFFS LEDEISVSSK LPkDCRITLV 

A. fumigatus 58128 GSkSCDTVDl GYQCsPAtSH LWGQYSPFFS LEDEISVSSK LPkDCRITLV 

A. fumigatus 26906 GSkSCDTVDl GYQCsPAtSH LWGQYSPFFS LEDEISVSSK LPkDCRITLV 

A. fumigatus 32239 GSkACDTVEl GYQCsPGtSH LWGQYSPFFS LEDElSVSSD LPkDCRVTFV 

E. nidulans QNHSCNTaDG GYQCfPNVSH VWGQYSPYFS lEQESAISeD VPhGCeVTFV 

T. thermopiiiJus DSHSCNTVEG GYQCrPEISH sWGQYSPFFS LADQSEISPD VPqNCKITFV 

r. lanuginosa nvDIAR hWGQYSPFFS LAEvSEISPA VPkGCRVeFV 

M, thermophila ESRPCDTpDl GFQCgTAISH FWGQYSPYFS VPsElDaS. . IPdDCeVTFa 

Basidio xSxPxrxtAA qLPipxQxqx xWSPYSPYFP VAxyxA. . . . pPaGCQIxqV 

Consensus NSHSCDTVDG GYQC-PEISH LWGQYSPFFS LADESAISPD VP-GCRVTFV 

PcplO NSHSCDTVDO GYQCPPBISH LWGQYSPFFS LADESAISPD VPKGCRVTFV 



A. terreus 9al QVLARHGARs PThSKTKaYA AtlaAIQKSA TaPpGKYAFL QSYNYSLDSE 

A. terreus cbs QVLARHGARs PTdSKTKaYA AtlaAIQKNA TaLpGKYAFL KSYNYSMGSE 
A. niger var. awamori QVLSRHGARY PTeSKGKKYS ALIeEIQQNv TtFDGKYAPL KTYNYSLGAD 

A. niger NRRL3135 QVLSRHGARY PTdSKGKKYS ALIeEIQQNA TtFDGKYAFL KTYNYSLGAD 

A. fumigatus 13073 QVLSRHGARY PTSSKSKKYk kLVtAIQaNA TdFKGKFAFL KTYNYTLGAD 

A. fumigatus 32722 QVLSRHGARY PTSSKSKKYk kLVtAIQaNA TdFKGKFAFL KTYNYTLGAD 

A. fumigatus 58128 QVLSRHGARY PTSSKSKKYk kLVtAIQaNA TdFKGKFAFL KTYNYTLGAD 

A. fumigatus 26906 QVLSRHGARY PTSSKSKKYk kLVtAIQaNA TdFKGKFAFL KTYNYTLGAD 

A. fumigatus 32239 QVLSRHGARY PTASKSKKYk kLVtAIQKNA TeFKGKFAFL ETYNYTLGAD 

E, nidulans QVLSRHGARY PTeSKSKaYS GLIeAIQKNA TsFwGQYAFL ESYNYTLGAD 

T, thermophilus QLLSRHGARY PTSSKTElYS qLIsrIQKtA TaYKGyYAFL KdYrYqLGAN 

r. lanuginosa QVLSRHGARY PTAhKSEvYA ELLqrIQDtA TeFKGDFAFL RdYaYhLGAD 

M, thermophila QVLSRHGARa PTlkRAasYv DLIdrlHhGA IsYgPgYEFL RTYDYTLGAD 

Basidio NIIqRHGARF PTSGaAtRiq AaVakLQsax xxtDPKLDFL xnxtYxLGxD 

Consensus QVLSRHGARY PTSSKSKKYS ALI-AIQKNA T-FKGKYAFL KTYNYTLGAD 

FcplO QVLSRHGARY PTSSKSKKYS ALIBAIQKNA TAPKGKYAFL KTYNYTLGAD 



lOX 

A. terreus 9al ELTPFGrNQL rDlGaQFYeR YNAL.TRhIn 

A. terreus cbs NLTPFGrNQL qDlGaQFYRR YDTL.TRhIn 
A. niger var. awamori DLTPFGEQEL VNSGIKFYQR YESL.TRnll 

A. niger NRRL3135 DLTPFGEQEL VNSGIKFYQR YESL.TRnlV 

A. fumigatus 13073 DLTPFGEQQL VNSGIKFYQR YKAL,ARsW 

A. fumigatus 32722 DLTPFGEQQL VNSGIKFYQR YKAL.ARsW 

A. fumigatus 58128 DLTPFGEQQL VNSGIKFYQR YKAL.ARsW 

A. fumigatus 26906 DLTAFGEQQL VNSGIKFYQR YKAL.ARsW 

A. fumigatus 32239 DLTPFGEQQM VNSGIKFYQK YKAL.AgsW 

E, nidulans DLTiFGENQM VDSGaKPYRR YKnL.ARknt 

T. thermophilus DLTPFGENQM IQlGIKFYnH YKSL.ARnaV 

T. lanuginosa NLTRFGEEQM MESGrQFYHR YREq.ARelV 

M, thermophila ELTRtGQQQM VNSGIKFYRR YRAL.ARksI 

Basidio DLvPFGAxQs sQAGqEaFtR YsxLvSxdnL 



PFVRATDAsR 
PFVRAADSsR 
PFIRSSGSsR 
PFIRSSGSsR 
PFIRASGSDR 
PFIRASGSDR 
PFIRASGSDR 
PFIRASGSDR 
PFIRSSGSDR 
PFIRASGSDR 
PFVRCSGSDR 
PFVRAAGSAR 
PFVRTAGqDR 
PFVRASGSDR 



150 

VhESAEKFVE 
VhESAEKFVE 
VIASGEKFIE 
VIASGKKFIE 
VIASGEKFIE 
VIASGEKFIE 
VIASGEKFIE 
VIASGEKFIE 
VIASGEKFIE 
WASAEKFIN 
VIASGrlFIE 
VIASAEfFnr 
WhSAENFtQ 
WDSAtNWtA 



Consensus DLTPFGEQQM VNSGIKFYRR YKAL-AR-IV PFVRASGSDR VIASAEKFIE 
FcplO DLTPFGEQQM VNSGIKFYRR YKAL.ARKTV PFVRASGSDR VTASAEKFIE 
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151 200 

A. terreus 9al GFQTARqDDh hAnphQPSPr VDValPEGsA YNNTLEHSLC TAFEs. . .St 

A. terreus cbs GFQNARqGDP hAnphQPSPr VDWIPEGtA YNNTLEHSIC TAFEa...St 

A. niger var. awamori GFQSTKLkDP rAqpgQSSPk iDWiSEAsS sNNTLDpGtC TvFEd...SE 

A. niger NRRL3 135 GFQSTKLkDP rAqpgQSSPk IDWISEAsS sNNTLDpGtC TvFEd, . .SE 

A. fumigsttus 13073 GFQqAKLADP gAt . nRAAPa ISVIIPESeT FNNTLDHGVC TkFEa...SQ 

A, flunigatus 32722 GFQqAKLADP gAt. nRAAPa ISVIIPESeT FNNTLDHGVC TkFEa...SQ 

A. fumigatus 58128 GFQqAKLADP gAt. nRAAPa ISVIIPESeT FNNTLDHGVC TkFEa...SO 

A, fumigatua 26906 GFQqAKLADP gAt. nRAAPa ISVIIPESeT FNNTLDHGVC TkFEa...SQ 

A. fumigatus 32239 GFQqANVADP gAt.nRAAPV ISVIIPESeT YNNTLDHSVC TnFEa...SE 

E. nidulans GFRkAQLhDh g.s.gQATPV VNVIIPEidG FNNTLDHStC vSFEn..,dE 

r. thermophilus GFQSAKVIDP hSdkhDAPPt INVIIeEGpS YNNTLDtGsC PvFEd. . .Ss 

r. lanuginosa GFQdAKdrDP rSnkdQAePV INVIISEEtG sNNTLDgltC PAaEe.-.Ap 

W. thermophila GFHSAILADR gStvrPTlPy dmWIPETaG aNNTLHNDLC TAFEegPySt 

Basidio GFaxA sxntxxPx LxVILSExg. .NDTLDDNMC PxAG 

Consensus GFQSAKLADP -A QASPV INVIIPEG-G YNNTLDHGLC TAPE--P-SE 

FcplO GFQSAKLADP GAKPHQASPV XNVXIPBQAG YNNTLDHGLC TAFBB...SE 



A. terreus 9a 1 

A. terreus cbs 

A. niger var. awamori 

A. niger NRRL3135 

A. fumigatus 13073 

A. fumigatus 32722 

A. fumigatus 58128 

A. fuinigatus 26906 

A. fumigatus 32239 

E. nidulans 

T. therinophilus 

T, lanuginosa 

M. thermophila 

Basidio 



201 

VGDDavANFT 
VGDAaADNFT 
LADtVEANFT 
LADtVEANFT 
LGDEVAANFT 
LGDEVAANFT 
LGDEVAANFT 
LGDEVAANFT 
LGDEVEANPT 
rADEIEANFT 
gGHDaQEKFA 
.DptqpAEFl 
IGDDaQDtYl 
dSDpqxnxWl 



AVFAPAIaqR 
AVFAPAIakR 
AtFAPSIRqR 
AtFvPSIRqR 
ALFAPdIRAR 
ALFAPdIRAR 
ALFAPdIRAR 
ALFAPdIRAR 
ALFAPAIRAR 
AIMGPPIRkR 
kqFAPAIlEK 
qVFGPRVlkK 
StFAGPItAR 
AVFAPPItAR 



LEAdLPOVQL 
LEAdLPGVQL 
LEndLSGVtL 
LEndLSGVtL 
aEkhLPGVtL 
aEkhLPGVtL 
aEkhLPGVtL 
aKkhLPGVtL 
lEkhLPGVQL 
LEndLPGIKL 
IKDhLPGVDL 
ItkhMPGVNL 
VNAnLPGaNL 
LNAaaPGaNL 



StDDWNLMA 
SADDWNLMA 
TDtEVtyLMD 
TDtEVtyLMD 
TDEDWSIiMD 
TDEDWSLMD 
TDEDWSLMD 
TDEDWSLMD 
TDDDWSLMD 
TNENVIyLMD 
AveDVpyLMD 
TlEDVplFMD 
TDADtVaLMD 
TDxDaxNLxx 



250 

MCPFETVSIT 
MCPFETVSIT 
MCSFDTIStS 
MCSFDTIStS 
MCSFDTVArT 
MCSFDTVArT 
MCSFDTVArT 
MCSFDTVArT 
MCSFDTVArT 
MCSFDTMArT 
LCPFETLArn 
LCPFDTVGsd 
LCPFETVAsS 
LCPFETVS . . 



Consensus LGDDVEANFT AVFAPPIRAR LEA-LPGVNL TDEDWNLMD MCPFDTVA-T 
FcplO LGDDVEANFT AVFAPPIRAR LEAKLPGVNL TDEDWNUID HCPFDTVART 







251 








300 


A. 


terreus 9a 1 


dD. 


.Aht . . . 


LSPF 


CDLFTa . 


.tE WtQYNYLlSL dKYYGYGGGN 


A. 


terreus cbs 


dD. 


.Aht . . . 


LSPF 


CDLFTa . 


.aE WtQYNYLlSL dKYYGYGGGN 


A. 


niger var. awamori 


TV. 


-DTK. . . 


LSPF 


CDLFTH. 


.dE WiHYDYLQSL kKYYGHGAGN 


A. 


niger NRRL3135 


Tv. 


.DTK. . . 


LSPF 


CDLFTH..dE WiNYDYLQSL kKYYGHGAGN 


A. 


fumigatus 13073 


SD. 


.ASQ. . . 


LSPF 


CQLFTH. 


.nE WkKYNYLQSL gKYYGYGAGN 


A. 


fumigatus 32722 


SD. 


.ASQ. . . 


LSPF 


CQLFTH. 


.nE WkKYNYLQSL gKYYGYGAGN 


A. 


fumigatus 58128 


SD. 


.ASQ. . . 


LSPF 


CQLFTH.. nE WkKYNYLQSL gKYYGYGAGN 


A. 


fumigatus 26906 


SD. 


.ASQ. . . 


LSPF 


CQLFTH. 


.nE WkKYNYLQSL gKYYGYGAGN 


A. 


fumigatus 32239 


AD. 


.ASE. . . 


LSPF 


CAIFTH. 


,nE WkKYDYLQSL gKYYGYGAGN 


E. 


niduiajis 


AH. 




LSPF 


CAIFTE. 


.kE WlQYDYLQSL sKYYGYGAGS 


T. 


thermoph ilus 


ht. 


.DT 


LSPF 


CALsTQ. 


• eE WqaYDYYQSL gKYYGnGGGN 


T. 


lanuginosa 






CHLFTa, 


.dD WmaYDYYyTL dKYYSHGGGS 


M. 


thermophila 


SsdpATadag ggngrpLSPF 


CrLFSE . 


-sE WraYDYLQSV gKWYGYGPGN 


Basidio 








CDLFexxpeE FxaFxYxgdL dKFYGtGyGQ 




Consensus 


SD- 


-ATQ 


LSPF 


CDLFTH - 


--E W-QYDYLQSL -KYYGYGAGN 




FcplO 






CDLFTH.. DB HIQYDYLQSL GRYYGTGAGN 
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A. terreuB 9a 1 

A. terreus cbs 

A. niger var. awamori 

A. niger NRRL3135 

A. fimigatus 13073 

A. fimigatus 32722 

A. fuiaigatus 58128 

A. fimigatus 26906 

A. fimigatus 32239 

E. nidulans 

T. theinnophilus 

r. lanuginosa 

M. ciierjnophila 

Basidio 



301 

PliGPvQGVGW 
PLGPvQGVGW 
PLGPTQGVGY 
PLGPTQGVGY 
PLGPAQGIGF 
PliGPAQGIGF 
P3JGPAQGIGF 
PLGPAQGIGF 
PliGPAQGIGF 
PLGPAQGIGF 
PLGPAQGVGF 
AFGPSRGVGF 
PLGPTQGVGF 
PLGPvQGVGY 



aNELMARLTR 
aNELIARLTR 
aNELIARLTH 
aNELIARLTH 
tNELIARLTR 
tNELIARLTR 
tNELIARLTR 
tNELIARLTR 
tNELIARLTN 
tNELIARLTQ 
vNELIARMTH 
vNELIARMTg 
vNELLARLA. 
iNELLARLTx 



A.PVHDHTCv 
S . PVHDHTCv 
S . PVHDDTSS 
S . PVHDDTSS 
S . PVQDHTST 
S . PVQDHTST 
S . PVQDHTST 
S . PVQDHTST 
S . PVQDHTST 
S . PVQDNTST 
S . PVQDYTTv 
NlPVKDHTTv 
GvPVRDgTST 
qa . VRDNTqT 



NNTLDASPAT 
NNTLDANPAT 
NHTLDSNPAT 
NHTLDSSPAT 
NsTLvSNPAT 
NsTLvSNPAT 
KsTLvSNPAT 
NsTLvSNPAT 
NsTLDSDPAT 
NHTLDSNPAT 
NHTLDSNPAT 
NHTLDdNPET 
NRTLDGDPrT 
NRTLDSSPxT 



350 

PPLNATLYAD 
FPLNATLYAD 
FPLNSTLYAD 
FPLNSTLYAD 
FPLNATMYvD 
FPLNATMYvD 
FPLNATMYvD 
FPLNATMYvD 
FPLNATIYvD 
FPLDrkLYAD 
FPLNATLYAD 
FPLDAvLYAD 
FPLGrPLYAD 
FPLNrTFYAD 



Consensus PLGPAQGVGF -NELIARLTH S- PVQDHTST NHTLDSNPAT FPLNATLYAD 
FcplO PLGPAQGVGF VNBLIARLTH S. PVQDHTST NHTLDSNPAT FPLNATLYAD 



A. terreus 9a 1 

A. terreus cbs 

A. niger var. awamori 

A. niger NRRL3135 

A. fimigatus 13073 

A. fimigatus 32722 

A. fimigatus 58128 

A. fimigatus 26906 

A. fimigatus 32239 

E. nidulans 

T, thermophilus 

T. lanuginosa 

M, thermophila 

Basidio 



351 

FSHDSnLVSI 
FSHDSnLVSI 
FSHDNGIISI 
FSHDNGIISI 
FSHDNSMVSI 
FSHDNSMVSI 
FSHDNSMVSI 
FSHDNSMVSI 
FSHDNGMIPI 
FSHDNSMISI 
FSHDNTMtSI 
FSHDNTMtGI 
FSHDNdMMGV 
FSHDNqMVAI 



FWALGLYNGT 
FWALGLYNGT 
LFALGLYNGT 
LFALGLYNGT 
FFALGLYNGT 
FFALGLYNGT 
FFALGLYNGT 
FFALGLYNGT 
FFAMGLYNGT 
FFAMGLYNGT 
FaALGLYNGT 
FsAMGLYNGT 
LgALGaYDGv 
FsAMGLFNqS 



aPLSqTSVE. 
kPLSqTTVE . 
kPLSTTTVE . 
kPLSTTTVE . 
ePLSrTSVE . 
gPLSrTSVE . 
ePLSrTSVE . 
ePLSrTSVE . 
ePLSqTSeE . 
qPLSmdSVE . 
akLSTTelK. 
kPLSTSklQP 
pPLdkTA. .R 
aPLdPSxpDP 



. SvsQTDGYA 
.ditrTDGYA 
.NitQTDGFS 
.NitQTDGFS 
.SaKElDGYS 
. SaKElDGYS 
. SaKElDGYS 
. SaKElDGYS 
.StKESNGYS 
.SiQEmDGYA 
.SiEETDGYS 
pTgAAADGYA 
rdpEElGGYA 
nrt Wv 



400 

AAWTVPFAAR 
AAWTVPFAAR 
SAWTVPFASR 
SAWTVPFASR 
ASWvVPFGAR 
ASWvVPFGAR 
ASWvVPFGAR 
ASWvVPFGAR 
ASWAVPFGAR 
ASWTVPFGAR 
AAWTVPFGGR 
ASWTVPFAAR 
ASWAVPFATUl 
TSklVPFSAR 



Consensus FSHDNTMVSI FFALGLYNGT -PLSTTSVEP -S-EETDGYA ASWTVPFAAR 
FcplO FSHDNTMVSI FFALGLYNGT KPLSTTSVB. .SIBETDGYA ASWTVPFAAR 



A. terrevs 9al 

A. terreus cbs 

A. niger var. awamori 

A, niger NRRL313 5 

A. fimigatus 13073 

A. fimigatus 32722 

A. fujnigatus 58128 

A. fimigatus 26906 

A. fimigatus 32239 

E, nidulans 

T . thermophi 1 u s 

T. lanuginosa 

M. thermophila 

Basidio 



401 450 

AYVEMMQC. . ra EKEPL VRVLVNDRVM PLHGCPtDKL 

AYIEMMQC . . ra EKQPL VRVLVNDRVM PLHGCAVDNL 

lYVEMMQC. . Qa EQEPL VRVLVNDRW PLHGCPIDaL 

lYVEMMQC. Qa EQEPL VRVLVNDRW PLHGCPVDaL 

AYfEtMQC. Ks EKEPL VRaLINDRW PLHGCDVDKL 

AYf EtMQC. . Ks EKEPL VRaLINDRW PLHGCDVDKL 

AYfEtMQC. Ks EKESL VRaLINDRW PLHGCDVDKL 

AYfEtMQC. Ks EKEPL VRaLINDRW PLHGCDVDKL 

AYfEtMQC. Ks EKEPL VRaLINDRW PLHGCAVDKL 

AYf ELMQC . E KKEPL VRVLVNDRW PLHGCAVDKF 

AYIEMMQC. Dd sDEPV VRVLVNDRW PLHGCEVDsL 

AYVELLRC . Etetsseeee EG. . .EDEPF VRVLVNDRW PLHGCrVDRW 

iYVEkMRC . . sggggggggg EGrqeKDEeM VRVLVNDRVM TLkGCGaDEr 

mvVErLxCxx xgtxxxxxxx xxxxxxxxxx VRVLVNDaVq PLEfCGgDxd 



Consensus AYVEMMQC-- E EG EKEPL VRVLVNDRW PLHGCGVDKL 

FcplO AYVEKHQC. . KA EKEPL VRVLVNDRW PLHGCGVDKL 
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451 482 



A. 


terreus 9a 1 


GRCKrDAFVA 


GLSFAQAG. . 


GNWADCF 




A, 


terreus cbs 


GRCKrDDFVE 


GLSFARAG. . 


GNWAECF 




A. 


niger var. awamori 


GRCtrDsFVr 


GLSFARSG. . 


GDWAECsA — 


— 


A, 


niger NRRIj3135 


GRCtrDsFVr 


GLSFARSG . . 


GDWAECFA — 




A, 


fxxmigatus 13073 


GRCKINDFVK 


GLSWARSG. . 


GNWGECFS-- 




A. 


fxmigatua 32722 


GRCKlNDFVK 


GLSWARSG . . 


GNWGECFS — 




A. 


fimigatus 58128 


GRCKINDFVK 


GLSWARSG. . 


GNWGECFS-- 




A, 


fxmigatus 26906 


GRCKINDFVK 


GLSWARSG. . 


GNWGECFS — 




A. 


fvmigatUB 32239 


GRCKIKDFVK 


GLSWARSG. . 


GNSEQSFS-^ 




E. 


nidulans 


GRCtlDDWVE 


GLNFARSG. . 


GNWKtCFTl- 




T, 


thermophiluB 


GRCKrDDFVr 


GLSFARqG. . 


GNWEGCYAas 


e- 


T. 


lanuginosa 


GRCRrDEWIK 


GLTFARqG . . 


GHWDrCF 




M. 


thermophila 


GmCtlErFIE 


SMAFARGN. , 


GKWD1CFA-- 




Basidio 


GxCtlDAFVE 


SqxYAReDgq GDFEKCFAtp 


XX 



Consensus 
FcplO 



GRCK-DDFVE GLSFARSG-- GNWEECFA-- 
GRCRRDDFVE GLSFARSG. . GNHSBCFA. , 
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Figure 5 

CP-l 

ECO RIMGVFVVLLSIATLFGST17 
TATATGAATTCAIfiGGCGTGTTCGTCGTGCTACTGTCCATTGCCACCTTGTTCGGTTCCA 

1 + + + ^ + + 

ATATACTTAAGTACCCGCACAAGCAGCACGATGACAGGTAACGGTGGAACAAGCCAAGGT 

SGTAI,GPRGNSHSCDTVDGG37 
CATCCGGTACCGCCTTGGGTCCTCGTGGTAATTCTCACTCTTGTGACACTGTTGACGGTG 

61 + + + + + + 120 

GTAGGCCATGGCGGAACCCAGGAGCACCATTAAGAGTGAGAACACTGTGACAACTGCCAC 

CP-2 

YQCFPEISHLWGQYSPZFSLS? 
GTTACCAATGTTTCCCAGAAATTTCTCACTTGTGGGGTCAATACTCTCCATTCTTCTCTT 

121 + + + + + + 180 

CAATGGTTACAAAGGGTCTTTAAAGAGTGAACACCCCAGTTATGAGAGGTAAGAAGAGAA 

aDESAISPDVPKaCRVTFVQ77 
TGGCTGACGAATCTGCTATTTCTCCAGACGTTCCAAAGGGTTGTAGAGTTACTTTCGTTC 

X81 + + + + + + 240 

ACCGACTGCTTAGACGATAAAGAGGTCTGCAAGGTTTCCCGACATCTCAATGAAAGCAAG 
CP-4,1Q 

CP-5.10 

VLSRHGARYPTSSKSKKYSA97 
AAGTTTTGTCTAGACACGQTGCTAGATACCCAACTTCTTCTAAGTCTAAGAAGTACTCTG 

241 + + + + + + 300 

TTCAAAACAGATCTGTGCCACGATCTATGGGTTGAA6AAGATTCAGATTCTTCATGAGAC 

LIEAIQKNATAFKGKYAFLK 117 
CTTTGATTGAAGCTATTCAAAAGAACGCTACTGCTTTCAAGGGTAAGTACGCTTTCTTGA 

301 + + + + + + 360 

GAAACTAACTTCGATAAGTTTTCTTGCGATGACGAAAGTTCCCATTCATGCGAAAGAACT 

CP- 6 

CP-7-10 

TYNYTLGADDIiTPFGEQQMV 137 
AGACTTACAACTACACTTTGGGTGCTGACGACTTGACTCCATTCGGTGAACAACAAATGG 

361 + + + + + -I- 420 

TCTGAATGTTGATGTGAAACCCACGACTGCTGAACTGAGGTAAGCCACTTGTTGTTTACC 

NSGIKFYRRYKALARKIVPF 157 
TTAACTCTGGTATTAAGTTCTACAGAAGATACAAGGCTTTGGCTAGAAAGATTGTTCCAT 

421 + + + + + + 480 

AATTGAGACCATAATTCAAGATGTCTTCTATGTTCCGAAACCGATCTTTCTAACAAGGTA 

CP-B.IO 

CP-9-10 

YRASGSDRVIASAEKFIEGF 177 
TCGTTAGAGCTTCTGGTTCTGACAGAGTTATTGCTTCTGCTGAAAAGTTCATTGAAGGTT 

481 + + + + + + 540 

AGCAATCTCGAAGACCAAGACTGTCTCAATAACGAAGACGACTTTTCAAGTAACTTCCAA 

QSAKLADPGAUPHQASPVIN 197 
TCCAATCTGCTAAGTTGGCTGACCCAGGTGCTAACCCACACCAAGCTTCTCCAGTTATTA 

541 + + + + + + 600 

AGGTTAGACGATTCAACCGACTGGGTCCACGATTGGGTGTGGTTCGAAGA6GTCAATAAT 
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CP-IO.IQ 

CP-11>10 

VIIPEGAGYNNTLDHGLCTA 217 
ACGTTATTATTCCAGAAGGTGCTGGTTACAACAACACTTTGGACCACGGTTTGTGTACTO 
eol + + + + + + 

TOCAATAATAAGGTCTTCCACGACCAATGTTGTTGTGAAACCTGGTGCCAAACACATGAC 

FEESELGDDVEANFTAyPAP 237 
CTTTCGAAGAATCTGAATTGGGTGACGACGTTGAAGCTAACTTCACTGCTOTTTTCGCTC 

661 + + + + + + 720 

GAAAGCTTCTTAGACTTAACCCACTGCTGCAACTTCGATTGAAGTGACGACAAAAGCGAG 

CP-12 . 10 

£IRARLEAliLPGVHIiTDEDV 257 
CACCTATTAGAGCTAGATTGGAAGCTCACTTGCCAGGTGTTAACTTGACTGACGAAGACG 

721 + + + + + + 780 

GTGGATAATCTCGATCTAACCTTCGAGTGAACGGTCCACAATTGAACTOACTGCTTCTGC 

CP-13-1P 

VNLMDMCPFDTVARTSDATQ 277 
TTGTTAACTTGATGGACATGTGTCCATTCGACACTGTTGCTAGAACTTCTGACGCTACTC 

781 + + + + + + 840 

AACAATTGAACTACCTGTACACAGGTAAGCTGTGACAACGATCTTGAAGACTGCGATGAG 

LSPFCELFTHDEWIQYDYLQ 297 
AATTGTCTCCATTCTGT6ACTTGTTCACTCACGACGAATGGATTCAATACGACTACTTGC 

841 + + + + + + 900 

TTAACAGAGGTAAGACACTGAACAAGTGAGTGCTGCTTACCTAAGrTATGCTGATGAACG 

CP-14.1Q 

CP-15,1Q 

SLGKYYGYGAGNPLGPAQGV 317 
AATCTTTGGGTAAGTACTACGGTTACGGTGCTGGTAACCCATTGGOTCCAGCTCAAGGTG 

901 + + + + + + 960 

TTAOAAACCCATTCATGATGCCAATGCCACGACCATTGGGTAACCCAGGTCGAGTTCCAC 

GFiJNELIARLTHSPVQDHTS 337 
TTGGTTTCGTTAACGAATTGATTGCTAGATTGACTCACTCTCCAGTTCAAGACCACACTT 

961 + + + + + + 1020 

AACCAAAGCAATTGCTTAACTAACGATCTAACTGAGTGAGAGGTCAAGTTCTGGTGTGAA 
CP-16.1Q 

CP-17,1Q 

TNHTLDSNPATF PLNATLYA 357 
CTACTAACCACACTTTGGACTCTAACCCAOCTACTTTCCCATTGAACGCTACTTTGTACG 

102X + + + + + + X080 

GATGATTGGTGTGAAACCTGAGATTGGGTCGATGAAAGGGTAACTTGCGATGAAACATGC 

DFSHDNlMySIFFALGLYNG 377 
CTGACTTCTCTCACGACAACACTATGGTTTCTATTTTCTTCGCTTTGGGTTTGTACAACG 

1081 + + + + + + 1140 

GACTGAAGAGAGTGCTGTTGTGATACCAAAGATAAAAGAAGCGAAACCCAAACATGTTGC 

CP-19.1Q 

CPt19.10 

TliPLSTTSVESIEETDGYAA 397 
GTACTAAGCCATTGTCTACTACTTCTGTTGAATCTATTGAAGAAACTGACGGTTACGCTG 

1141 + + -- + + + + 1200 

CATGATTCGGTAACAGATGATGAAGACAACTTAGATAACTTCTTTGACTGCCAATGCGAC 
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SWTVPFAARAYVEMMQCEAE 417 
CTTCTTGOACTGTTCCATTCGCTGCTAGAGCTTACGTTGAAATGATGCAATGTGAAGCTG 

1201 + + + + - + + 1260 

GAAGAACCTGACAAGGTAAGCCACGATCTCGAATGCAACTTTACTACGTTACACTTCGAC 

CP-20.10 

CP-21.1Q 

KEPLVRVIiVNDRVVPLHGCS 437 
AAAAGGAACCATTGGTTAGAGTTTTGGTTAACGACAGAGTTGTTCCATTGCACGGTTGTG 

1261 + --- + + + + 1320 

TTTTCCTTGGTAACCAATCTCAAAACCAATTGCTGTCTCAACAAGGTAACGTGCCAACAC 

VDKLGRCKRDDFVEGLSFAR 457 
GTGTTGACAAGTTGGGTAGATGTAAGAGAGACGACTTCGTTOAAGGTTTGTCTTTCGCTA 

1321 + + + + + + 1380 

CACAACTGTTCAACCCATCTACATTCTCTCTGCTGAAGCAACTTCCAAACAGAAAGCGAT 

10 

SGGNWfiECFA* Eco RI 467 
GATCTGGTGGTAACTGGGAAGAATGTTTCGCT TAAGAATTCATATA 

1381 + + + - + 1426 

CTAGACCACCATTGACCCTTCTTACAAAGCGAATTCTTAAGTATAT 
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Figure g 
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1 

involutus {phyAD FPipeseqR 

involutus {phyA2) FsipeseqR 

puhescens - LDvtRDVqQ 

pediades -pffpPQIqD 

^ LPipAQnTs 

terreus 9al KhsdCNSVDh GYQCfPELSH 

terreus cbs NhsdCtSVDr GYQCfPELSH 

niger var. awamori NqsTCDTVDq GYQCfSEtSH 

niger T213 NqsSCDTVDq GYQCfSEtSH 

niger NRRL3135 NqsSCDTVDq GYQCfSEtSH 

fumigatus ATCC13073 GSkSCDTVDl GYQCsPAtSH 

fumigatus ATCC32722 GSkSCDTVDl GYQCsPAtSH 

fumigatus ATCC58128 GSkSCDTVDl GYQCsPAtSH 

fumigatus ATCC26906 GSkSCDTVDl GYQCsPAtSH 

fumigatus ATCC32239 GSkACDTVEl GYQCsPGtSH 

nidalans QNHSCNTaDg GYQCfPNVSH 

thermophiluB DSHSCNTVEg GYQCrPEISH 

lanuginosa - nvDIAR 

thermophila ESRPCDTpDl GFQCgTAISH 
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KSHSCDTVD- GYQC-PEISH LWGQYSPFFS LADESAISPD VPKGCRVTFV 



P. involutus {phyAD 

P. involutus {phyA2) 

T. pubescens 

A. pediades 

P. lycii 

A. terreus 9a 1 

A. terreus cbs 

A. niger var. awamori 

A. jiiger T213 

A. niger NRRL3135 

A. fumigatus ATCC13073 

A. fumigatus ATCC32722 

A. fumigatus ATCC58128 

A. fumigatus ATCC26906 

A. fumigatus ATCC32239 

E. nidulans 

T. thermophilus 

T. lanuginosa 

M. thermophila 
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NIIqRHGARF 
NIIqRHGARF 
HIIqRHGARF 
NIIqRHGARF 
NLIqRHGARW 
QVLARHGARs 
QVLARHGARs 
QVLSRHGARY 
QVLSRHGARY 
QVLSRHGARY 
QVLSRHGARY 
QVLSRHGARY 
QVLSRHGARY 
QVLSRHGARY 
QVLSRHGARY 
QVLSRHGARY 
QLLSRHGARY 
QVLSRHGARY 
QVLSRHGARa 
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AgLtKLQgvq 

AgLsKLQsvq 

TaVAKLKaaS 

AaVKKLQsak 

AaVAKIQmar 

AtlAalQKSA 
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kLVtalQaNA 

kLVtalQaNA 

kLVtalQaNA 
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nftDAKFnFI 
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nytDPlLAFV 

TytDPRLDFL 
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TaFpGKYAFL 

TaLpGKYAFL 

TtFDGKYAFL 
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KSFKYdLGns 

KSFtYdLGTs 

tnYtYSLGqD 

tnYtYTLGhD 

NdFvYkFGvA 

QSYNYSLDSE 

KSYNYSMGSE 

KTYNYSLGAD 

KTYNYSLGAD 

KTYNYSLGAD 

KTYNYTLGAD 

KTYNYTLGAD 

KTYNYTLGAD 

KTYNYTLGAD 

ETYNYTLGAD 

ESYNYTLGAD 

KdYrYqLGAN 

RdYaYhLGAD 

RTYDYTLGAD 



QVLSRHGARY PTSSKSKKYS ALIBRIQKNA T-FKGKYAFL KTYNYTLGAD 
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p. involutus {phyAD 

P. involutus iphyA2) 

T, pubescens 

A. pediades 

P. lycii 

A, terreus 9al 

A. terreu3 cbs 

A, niger var. awamori 

A. niger T213 

A. niger NRRL3135 

A. fumlgatus ATCC13073 

A. fumigatus ATCC32722 

A. fimigatus ATCC5ei28 

A. fumigatus ATCC2e906 

A. fumigatus ATCC32239 

E, nidulans 

T. thermophilus 

T. lanuginosa 

M. thermophila 
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P. involutus iphyAl) 

P. involutus {phyA2) 

T. pubescens 

A. pediades 

P. lycii 

A. terreus 9a 1 

A. terreus cbs 

A. niger var. awamori 

A. niger T213 

A. niger NRRL3135 

A. fumigatus ATCC13073 

A. fumigatus ATCC32722 

A. fumigatus ATCC58128 

A. fumigatus ATCC26906 

A. fumigatus ATCC32239 

E. nidulans 

T. thermophxlus 

T. lanuginosa 

M, thermophila 
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DLvPFGAaQs 
DLvPFGAaQs 
sLveLGAtQs 
DLvPFGAlQs 
DLlPFGANQs 
ELTPFGrNQL 
NLTPFGrNQL 
DLTPFGEQEL 
DLTPFGEQEL 
DLTPFGEQEL 
DLTPFGEQQL 
DLTPFGEQQL 
DLTPFGECX5L 
DLTAFGEQQL 
DLTPFGEQQM 
DLTiFGENQM 
DLTPFGEMQM 
NLTRFGEEQM 
ELTRtGQQQM 



fDAGqEaFaR 
fDAGLEvFaR 
sEAGqEaFtR 
sQAGeEtPQR 
hQTGtDMYtR 
rDlGaQFYeR 
qDlGaQFYRR 
VNSGIKFYQR 
VNSGIKFYQR 
VNSGIKFYQR 
VNSGIKFYQR 
VNSGIKFYQR 
VNSGIKFYQR 
VNSGIKFYQR 
VNSGIKFYQK 
VDSGaKFYRR 
IQlGIKFYnH 
MESGrQFYHR 
VNSGIKFYRR 



YskLvSKNnL 
YsklivSsDnL 
YsSLvSaDeL 
YsfLvSKEnL 
YsTLfEgGdV 
YNAL.TRHIn 
YDTL.TRHIn 
YESIi.TRNII 
YESL.TRNII 
YESL.TRNIV 
YKAL.ARSW 
YKAL.ARSW 
YKAL.ARSW 
YKAL.ARSW 
YKAL.AgSW 
YKnL . ARKht 
YKSL.ARNaV 
YREq.AREIV 
YRAL.ARKsI 



PFIRAdGSDR 
PFIRSdGSDR 
PFVRASGSDR 
PFVRASSSNR 
PFVRAAGdQR 
PFVRATDAsR 
PFVRAADSSR 
PFIRSSGSsR 
PFIRSSGSsR 
PFIRSSGSsR 
PFIRASGSDR 
PFIRASGSDR 
PFIRASGSDR 
PFIRASGSDR 
PFIRSSGSDR 
PFIRASGSDR 
PFVRCSGSDR 
PFVRAAGSAR 
PFVRTAGqDR 
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WDSAtNWtA 
WDTAtNWtA 
WATANNWtA 
WDSAtNWtE 
WDSStNWtA 
VhESAEKFVE 
VhESAEKFVE 
VIASGEKFIE 
VIASGEKFIE 
VIASGKKFIE 
VIASGEKFIE 
VIASGEKFIE 
VIASGEKFIE 
VIASGEKFIE 
VIASGEKFIE 
WASAEKFIN 
VIASGrlFIE 
VIASAEf Fnr 
WhSAENFtQ 



DLTPFGKNQM VNSGIKFYRR YKAL-ARNTV PFVRASGSDR VIASAKKFIS 
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GFaSA. . 
GFaSA. . 
GFalA. . 
GFSAA. . 
GFgdA. . 
GFQTARqDDh 
GFQNARqGDP 
GFQSTKLkDP 
GFQSTKLkDP 
GFQSTKLkDP 
GFQqAKLADP 
GFQqAKLADP 
GFQqAKLADP 
GFQqAKLADP 
GFQqANVADP 
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GFQSAKVIDP 
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GFHSAILADR 



. . shNtvqPk 
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. . shHvlNPI 
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hAnpHQPSPr 
hAnpHQPSPr 
rAqpgQSSPk 
rAqpgQSSPk 
rAqpgQSSPk 
gAt .NRAAPa 
g At . NRAAPa 
gAt - NRAAPa 
gAt -NRAAPa 
gAt .NRAAPV 
g.s.gQATPV 
hSdkHDAPPt 
rSnkDQAePV 
gStvRPTlPy 



LNLILPQ. .T 
LDLILPQ. .T 
LSVIISE. .A 
LfVILSE. .S 
LQWLQE. .E 
VDValPEGSA 
VDWIPEGTA 
IDWISEASS 
IDWISEASS 
IDWISEASS 
ISVIIPESeT 
ISVIIPESeT 
ISVIIPESeT 
ISVIIPESeT 
ISVIIPESeT 
VNVIIPEidG 
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dmWIPETAG 



gNDTLEDNMC 
gNDTLEDNMC 
gNDTLDDNMC 
LNDTLDDAMC 
gNcTLcNNMC 
YNNTLEHSLC 
YNNTLEHSIC 
sNNTLDpGtC 
sNNTLDpGtC 
SNNTLDpGtC 
FNNTLDHGVC 
FNNTLDHGVC 
FNNTLDHGVC 
FNNTLDHGVC 
YNNTLDHSVC 
FNNTLDHStC 
YNNTLDtGsC 
sNNTLDgltC 
aNNTLHNDLC 
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PAaGD 

PAaGE 

PAaGD 

PnaGs 

PnevD 

TAFEs ... ST 
TAFEA. . .ST 
TvFED- . .Se 
TvFED . . . se 
TvFED . , . se 
TkFEA. . .Sq 
TkFEA . . . Sq 
TkFEA. . .Sq 
TkFEA. . .Sq 
TnFEA. . .Se 
vSFEN. . .de 
PvFED . . . SS 
PAaEE . . . AP 
TAFEEgpyST 
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201 250 

P. involutua (phyAl) .SDpqvnaWl AVafPSItAR LNAaaPSVNL TDtDafNLVs LCAPlTVSK. 

P. involutua {phyA2) .SDpqvDaWl AsafPSVtAQ LNAaaPGaNL TDADafNLVs liCPPmTVSK. 

T. pvt>escen3 .SDpqvnQWl AqFAPPMtAR LNAgaPGaNL TDtDtyNLLt LCPFETVAt. 

A. pediades .SDpqtGiWT SIYGTPIanR LNqqaPGaNI TAADVsNLTp LCAFETXvK. 

P. lycil .GDESt.tWl GVFAPnItAR LNAaaPSaNL SDsDaLtLMD MCPFDTLSb. 

A. tGrreus 9al VGDDAvANFT AVPAPAIaqR LEAdLPGVQL StDDWNLMA MCPFETVSIT 

A. terreus cbs VGDAAADNFT AVFAPAIakR LEAdLPGVQL SADDWNLMA MCPFETVSIT 

A. niger var. Awamori LADtvEANFT AtFAPSIRqR LEndLSGVtL TDtEVtyLMD MCSFDTIStS 

A. niger T213 LADtvEANFT AtFAPSIRqR LEndLSGVtL TDtEVtyLMD MCSFDTIStS 

A. niger NRRL3135 LADtvEANFT AtFvPSIRqR LEndLSGVtL TDtEVtyLMD MCSFDTIStS 

A. fvmigatus ATCC13073 LGDEvAANFT ALFAPdIRAR aEkhLPGVtL TDEDWSLMD MCSFDTVART 

A. fumigatus ATCC32722 LGDEvAANFT ALFAPdIRAR aEkhLPGVtL TDEDWSLMD MCSFDTVART 

A. fumigatus ATCC58128 LGDEvAANFT ALFAPdIRAR aEkhLPGVtL TDEDWSLMD MCSFDTVART 

A. fumigatus ATCC26906 LGDEvAANFT ALFAPdIRAR aKkhLPGVtL TDEDWSLMD MCSFDTVART 

A. fumigatus ATCC32239 LGDEvEANFT ALFAPAIRAR lEkhLPGVQL TDDDWSLMD MCSFDTVART 

E, nidulans rADEiEANFT AIMGPPIRkR LEndLPGIKL TNENVIyLMD MCSFDTMART 

r. thermophilus gGHDAQEKFA kqFAPAIlEK IKDhLPGVDL AvsDVpyLMD LCPFETLARn 

r. lanuginosa .DptqpAEFl qVFGPRVlkK ItkhMPGVNL TlEDVplFMD LCPFDTVGsd 

W. thermophila IGDDAQDtYl StFAGPItAR VNAnLPGaNL TDADtVaLMD LCPFETVAsS 
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LODDAEANFT AVFAPPIRAR LEA-LPGVNL TDEDWNLHD MCPFDTVART 



251 300 

P. inwlutus {phyAl) ekkSdP CtLFegiPGs FeaFAYggdL dKFYGtGyGQ 

P, involutus {phyA2) eqkSdF CtLFegiPGs FeaFAYagdL dKFYGtGyGO 

T. puJbescens errSeF CDIYeelqAE .daFAYnadL dKFYGtGyGQ 

A. pediades etpSPF CNLF. .TPEE FaQFEYFgdL dKFYGtGyGQ 

P. lycii gnaSPF CDLF. .TAEE YvsYEYYydL dKYYGtGPGN 

A. terreus 9al dD. .Aht LSPF CDLF. .TAtE WtQYNYLlSL dKYYGYGGGN 

A. terreus cbs dD. .Aht LSPF CDLF. .TAAE WtQYNYLlSL dKYYGYGGGN 

A. niger var. awamori Tv. .DTK LSPF CDLP..ThDE WiHYDYLQSL kKYYGHGAGN 

A. niger T213 Tv. .DTK LSPF CDLF. .ThDE WiHYDYLRSL kKYYGHGAGN 

A. niger NRRL313 5 Tv. .DTK LSPF CDLF. .ThDE WiNYDYLQSL kKYYGHGAGN 

A. fumigatus ATCC13073 SD. .ASQ LSPF CQLF..ThNE WkKYNYLQSL gKYYGYGAGN 

A. fumigatus ATCC32722 SD. ,ASQ LSPF CQLF..ThNE WkKYNYLQSL gKYYGYGAGN 

A. fumigatus ATCC58128 SD..ASQ LSPF CQLF..ThNE WkKYNYLQSL gKYYGYGAGN 

A, fumigatus ATCC2690e SD. .ASQ LSPF CQLF..ThNE WkKYNYLQSL gKYYGYGAGN 

A. fumigatus ATCC32239 AD. .ASE LSPF CAIF..ThNE WkKYDYLQSL gKYYGYGAGN 

E. nidulans ah. .GTE LSPF CAIF. .TEKE WIQYDYLQSL sKYYGYGAGS 

T. thermophilus ht..DT LSPF CALs..TqEE WqaYDYYQSL gKYYGnGGGN 

T. lanuginosa PvlfPrQ LSPF CHLF. .TADD WtnaYDYYyTL dKYYSHGGGS 

M, thermophila SsdpATadag ggngrpLSPF CrLF. .SEsE WraYDYLQSV gKWYGYGPGN 
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P. involutus {phyAD 

P. involutus {phyA2) 

T. puhescens 

A. pediades 

P. lycii 

A. terreus 9al 

A. terreus cbs 

A. niger var. awa;nori 

A. niger T213 

A. niger NRRL3135 

A. fumigatus ATCC13073 

A. fiunigatus ATCC32722 

A. fumigatus ATCC58128 

A. ftmigatus ATCC26906 

A. fimiigatus ATCC32239 

E. nidulans 

T. thermophilus 

T. lanuginosa 

M. thermqpiJiia 
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P. involutus {phyAD 

P. involutus {phyA2) 

T, puhescens 

A, pediades 

P, lycii 

A. terreus 9al 

A. terreus cbs 

A. niger var. awamori 

A. niger T213 

A. niger NRRL3135 

A, fumigatus ATCC13073 

A. fumigatus ATCC32722 

A. fujnigatus ATCC58128 

A. fumigatus ATCC26906 

A. fumigatus ATCC32239 

E. nidulans 

T. thermophilus 

T, lanuginosa 

M, thermophila 
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301 350 
eliGPvQGVGY vNELIARLTN S.AVRDNTqT NRTLDASPvT FPLNkTFYAD 
ALGPvQGVGY iNELLARLTN S.AVNDNTqT NRTLDAaPDT FPLNkTMYAD 
PLGPvQGVGY INELIARLTa q.nVsDHTqT NsTLDSSPET FPUJrTIiYAD 
PLGPvQGVGY iNELIARLTE m.PVRDNTqT NRTUDSSPlT FPLDrSIYAD 
ALGPvQGVGY vNELlARLTg q.AVRDETqT NRTLDSDPAT FPLNrTFYAD 
PliGPvQGVGW aNELMARLTR A.PVHDHTCv NNTLDASPAT FPLNATLYAD 
PLGPvQGVGW aNELIARLTR S.PVHDHTCv NNTLDANPAT FPLNATLYAD 
PLGPTQGVGY aNELIARLTH S.PVHDDTSS NHTLDSNPAT FPLNSTLYAD 
PIjGPTQGVGY aNELIARLTH S.PVHDDTSS NHTLDSNPAT FPLNSTLYAD 
PLGPTQGVGY aNELIARLTH S.PVHDDTSS NHTLDSSPAT FPLNSTLYAD 
PLGPAQGIGF tNELIARLTR S.PVQDHTST NsTLvSNPAT FPLNATMYvD 
PLGPAQGIGF tNELIARLTR S.PVQDHTST NsTLvSNPAT FPLNATMYvD 
PLGPAQGIGF tNELIARLTR S.PVQDHTST NsTLvSNPAT FPLNATMYvD 
PLGPAQGIGF tNELIARLTR S.PVQDHTST NsTLvSNPAT FPLNATMYvD 
PLGPAQGIGF tNELIARLTN S.PVQDHTST NsTLDSDPAT FPLNATIYvD 
PLGPAQGIGF tNELIARLTQ S.PVQDNTST NHTLDSNPAT FPLDrkLYAD 
PLGPAQGVGF vNELIARMTH S.PVQDYTTv NHTLDSNPAT FPLNATLYAD 
AFGPSRGVGF vNELIARMTg NlPVKDHTTv NHTLDdNPET FPLDAvLYAD 
PLGPTQGVGF vNELLARLA. GvPVRDgTST NRTLDGDPrT FPLGrPLYAD 

PLGPAQGVGF -NELIARLTH S-PVQDHTST NHTLDSNPAT FPLNATLYAD 

400 

351 

FSHDNIMVAV FsAMGLFrqP aPLSTSvpNP wrt Wr TSSlVPPSGR 

FSHDNIMVAV FsAMGLFrqS aPLSTSTpDP nrt Wl TSSvVPFSAR 

FSHDNqMVAI FsAMGLFNqS aPLdPTTpDP art Fl vkkiVPFSAR 

LSHDNqMIAI FsAMGLFNqS sPLdPSfpNP krt Wv TSRltPFSAR 

FSHDNTMVPI FaALGLFNAT a.LdPlkpDe nrl Wv DSklVPFSGH 

FSHDSnLVSI FWALGLYNGT aPLSqTSVES Vs.-QTDGYA AAVfTVPFAAR 
FSHDSnLVSI FWALGLYNGT KPLSqTTVEd It..rTDGYA AAWTVPFAAR 
FSHDNGIISI LFALGLYNGT KPLSTTTVEN It..QTDGFS SAWTVPFASR 
FSHDNGIISI LFALGLYNGT KPLSTTTVEN It..QTDGFS SAWTVPFASR 
FSHDNGIISI LFALGLYNGT KPLSTTTVEN It..QTDGFS SAWTVPFASR 
FSHDNSMVSI FFALGLYNGT EPLSrTSVES ak..ElDGYS ASWvVPFGAR 
FSHDNSMVSI FFALGLYNGT gPLSrTSVES ak. .ElDGYS ASWvVPFGAR 
FSHDNSMVSI FFALGLYNGT EPLSrTSVES ak. .ElDGYS ASWvVPFGAR 
FSHDNSMVSI FFALGLYNGT EPLSrTSVES ak.. ElDGYS ASWvVPFGAR 
FSHDNGMIPI FFAMGLYNGT EPLSqTSeES tk. .ESNGYS ASWAVPFGAR 
FSHDNSMISI FFAMGLYNGT QPLSmdSVES Iq. .EmDGYA ASWTVPFGAR 
FSHDNTMtSI FaALGLYNGT akLSTTelKS le. .ETDGYS AAWTVPFGGR 
FSHDNTMtGI FsAMGLYNGT KPLSTSklQP ptgaAADGYA ASWTVPFAAR 
FSHDNdMMGV LgALGaYDGv pPLdkTArrd . .peElGGYA ASWAVPFAAR 

FSHDNTMVSI FFALGLYNGT KPLSTTSVBS I---ETDayA ASWTVPFAAR 
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P. 1 n vol ut us {phyAD 

P, involutus {phyA2) 

T. pubescens 

A* pediades 

p. lycii 

A. terreus 9a 1 

A. terreus cbs 

A, nlger var. awanjori 

A. niger T213 

A. Jiiger NRRL3135 

A. fumigatus ATCC13073 

A. fuinlgatus ATCC32722 

A. fumigatus ATCC58128 

A. fumigatus ATCC26906 

A. fu/nigatus ATCC32239 

F. nidulans 

T. tiiermopiiilus 

T. lanuginosa 

M. thermophila 
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p. involutus iphyAl) 

p. involutus {phyA2) 

r. pubescens 

A. pediades 

P. lycxi 

A. terreus 9a l 

A. terreus cbs 

A. niger var. awamori 

A. Jiiger T213 

A. niger NRRL313 5 

A. fumisratus ATCC13073 

A. funiiflratus ATCC32722 

A. fujnigatus ATCC58128 

A. fujnigatus ATCC26906 

A. fumigatus ATCC32239 

E. nidulans 

T. tiiermophilus 

T. lanuginosa 

M. thermophila 
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401 «° 

mvVErLsC. fGt Tk VRVLVQDQVq PIiEfCGgDRn 

maVErLsC. AGt Tk VRVLVQDQVq PLEfCGgDQd 

mvVErLDC. GGa Qs VRLLVNDaVq PLafCGaDts 

mvtErLlCQr DGtGsGGpsr imrNgnvQTF VRIIiVNDaLq PLkfCGgDmd 

mtVEkLaC sgKea VRVLVNDaVq PLEfCGg-vd 

AYVEMMQCrA EK. . .EPL VRVLVNDRVM PLHGCPtDKL 

AYlEMMQCrA EK. . -QPL VRVLVNDRVM PLHGCAVDNL 

lYVEMMQCQA EQ...EPL VRVLVNDRW PLHGCPIDaL 

lYVEMMQCQA EQ...EPL VRVLVNDRW PLHGCPIDaL 

lYVEMMQCQA EQ. . .EPL VRVLVNDRW PLHGCPVDaL 

AYfEtMQCKS EK. . .EPL VRaLINDRW PLHGCDVDKL 

AYfEtMQCKS EK...EPL VRaLINDRW PLHGCDVDKL 

AYfEtMQCKS EK. . .ESL VRaLINDRW PLHGCDVDKL 

AYfEtMQCKS EK...EPL VRaLINDRW PLHGCDVDKL 

AYfEtMQCKS EK. . ,EPL VRaLINDRW PLHGCAVDKL 

AYfELMQCE KK. . .EPL VRVLVNDRW PLHGCAVDKF 

AYIEMMQCDD sD. . .EPV VRVLVNDRW PLHGCEVDsL 

AYVELLRCET ETsSeEEeEG . .ED. . .EPF VRVLVNDRW PLHGCrVDRW 
iWEkMRCsG GGgGgGGgEG ..rQekdEeM VRVLVNDRVM TLkGCGaDEr 

AYVEKMQCKA GG-G-OG-BG --SK---EPL VRVLVNDRW PLHQCGVDKL 

451 482 
GlCtLAKFVE SqTFARSDga GDFEKCFAtS a- 
GlCaLDKFVE SqAYARSGga GDFEKCLAtt v- 
GvCtLDAFVE SqAYARNDge GDFEKCFAt- — 

SlCtLEAFVE SqkYAReDgq GDFEKCFD 

GvCELsAFVE SqTYAReNgq GDFAKCgfvp se 

GRCKrDAFVA GLSFAQAG. . GNWADCF 

GRCKrDDFVE GLSFARAG. . GNWAECF — 

GRCtrDsFVr GLSFARSG. . GDWAECsA — — 
GRCtrDsFVr GLSFARSG. . GDWAECFA— — 
GRCtrDsFVr GLSFARSG. . GDWAECFA— 
GRCKLNDFVK GLSWARSG. . GNWGECFS — — 
GRCKLNDFVK GLSW/^SG. . GNWGECFS — 
GRCKLNDFVK GLSWARSG. . GNWGECFS — — 

GRCKLNDFVK GLSWARSG. . GNWGECFS 

GRCKLKDFVK GLSWARSG. . GNSEQSFS— — 
GRCtLDDWVE GLNFARSG- . GNWktCFTl- — 
GRCKrDDFVr GLSFARqG. . GNWEGCYAas e- 

GRCRrDEWIK GLTFARqG. . GHWDrCF 

GmCtLErFIE SMAFARGN. . GKWDICFA — — 

ORCKLDDFVE GLSFARSG-- ONWAECFA-- -- 
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Figure 7 



MGVFVVLLSIATLFGSTSGT 20 
ATGGGCGTGTTCGTCGTGCTACTGTCCATTGCCACCTTGTTCGGTTCCACATCCGGTACC 

^ + + + + + + 60 

TACCCGCACAAGCAGCACGATGACAGGTAACGGTGGAACAAGCCAAGGTGTAGGCCATGG 

ALGPRGNSHSCDTVDGGYQC 40 
GCCTTGGGTCCTCGTGGTAATTCTCACTCTTGTGACACTGTTGACGGTGGTTACCAATGT 

+ + + + + -+ 120 

CGGAACCCAGGAGCACCATTAAGAGTGAGAACACTGTGACAACTGCCACCAATGGTTACA 

FPEISHLWGXYSPYFSLADE 60 
TTCCCAGAAATTTCTCACTTGTGGGGTACCTACTCTCCATACTTCTCTTTGGCAGACGAA 

121 + + + + 180 

AAGGGTCTTTAAAQAGTGAACACCCCATGGATGAGAGGTATGAAGAGAAACCGTCTGCTT 

SAISPDVPDDCRVTFVQVLS 80 
TCTGCTATTTCTCCAGACGTTCCAGACGACTGTAGAGTTACTTTCGTTCAAGTTTTGTCT 

187 — + ^ ^ * 

AGACGATAAAGAGGTCTGCAAGGTCTGCTGACATCTCAATGAAAGCAAGTTCAAAACAGA 

RHGARYPTSSaSKAYSAIilE 100 
AGACACGGTGCTAGATACCCAACTTCTTCTGCGTCTAAGGCTTACTCTGCTTTGATTGAA 

241 + . ^ 300 

TCTGTGCCACGATCTATGGGTTGAAGAAGACGCAGATTCCGAATGAGACGAAACTAACTT 

AIQKNATAFKGKYAFLKTYN 120 
GCTATTCAAAAGAACGCTACTGCTTTCAAGGGTAAGTACGCTTTCTTGAAGACTTACAAC 

301 ^ ^ + + ^ ^ 360 

CGATAAGTTTTCTTGCGATGACGAAAGTTCCCATTCATGCGAAAGAACTTCTGAATGTTG 

YTIiGADDLTPFGENQMVNSG 140 
TACACTTTGGGTGCTGACGACTTGACTCCATTCGGTGAAAACCAAATGGTTAACTCTGGT 

3ei ^ ^ 420 

ATGTGAAACCCACGACTGCTGAACTGAGGTAAGCCACTTTTGGTTTACCAATTGAGACCA 

IKFYRRYKAIiARKIVPFIRA 160 
ATTAAGTTCTACAGAAGATACAAGGCTTTGGCTAGAAAGATTGTTCCATTCATTAGAGCT 

421 + + ^ + ^ ^ 480 

TAATTCAAGATGTCTTCTATGTTCCGAAACCGATCTTTCTAACAAGGTAAGTAATCTCGA 

SGSDRVIASAEKFIEGFQSA 180 
TCTGGTTCTGACAGAGTTATTGCTTCTGCTGAAAAGTTCATTGAAGGTTTCCAATCTGCT 

481 ^ + + ^ ^ 540 

AGACCAAGACTGTCTCAATAACGAAGACGACTTTTCAAGTAACTTCCAAAGGTTAGACGA 

KLADPGSQPHQASPVIHVII 200 
AAGTTGGCTGACCCAGGTTCTCAACCACACCAAGCTTCTCCAGTTATTAACGTGATCATT 

541 + . . -H -H 600 

TTCAACCGACTGGGTCCAAGAGTTGGTGTGGTTCGAAGAGGTCAATAATTGCACTAGTAA 

PEGSGYNNTLDHGTCTAFED 220 
CCAGAAGGATCCGGTTACAACAACACTTTGGACCACGGTACTTGTACTGCTTTCGAAGAC 

601 + + + + + 660 

GGTCTTCCTAGGCCAATGTTGTTGTGAAACCTGGTGCCATGAACATGACGAAAGCTTCTG 
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SELGDDVEANFTALFAPAIR 240 
TCTGTVATTAGGTGACGACGTTGAAGCTAACTTCACTGCTTTGTTCGCTCCAGCTATTAGA 

ggl + + + + + 720 

AGACTTAATCCACTGCTGCAACTTCGATTGAAGTGACGAAACAAGCGAGGTCGATAATCT 

ARLEADLPGVTLTDEDVVYIi 260 
GCTAGATTGGAAGCTGACTTGCCAGGTGTTACTTTGACTGACGAAGACGTTGTTTACTTG 

721 + + + + + + 780 

CGATCTAACCTTCGACTGAACGGTCCACAATGAAACTGACTGCTTCTGCAACAAATGAAC 

MDMCPFDTVARTSDATELSP 280 
ATGGACATGTGTCCATTCGACACTGTCGCTAGAACTTCTGACGCTACTGAATTGTCTCCA 

781 ^ ^ * ^ * 840 

TACCTGTACACAGGTAAGCTGTGACAGCGATCTTGAAGACTGCGATGACTTAACAGAGGT 

FCALFTHDEWXQYDYLQSIiG 300 
TTCTGTGCTTTGTTCACTCACGACGAATGGATCCAATACGACTACTTGCAAAGCTTGGGT 

. . + + 900 

g42^ 1- -I + ■* * 

AAGACACGAAACAAGTGAGTGCTGCTTACCTAGGTTATGCTGATGAACGTTTCGAACCCA 

KYYGYGAGNPLGPAQGVGFA 320 

T^GTACTACGGTTACGGTGCTGGTAACCCATTGGGTCCAGCTCAAGGTGTTGGTTTCGCT 

^ . + + + + 960 

901 + + + f -r 

TTCATGATGCCAATGCCACGACCATTGGGTAACCCAGGTCGAGTTCCACAACCAAAGCGA 

NELIARLTHSPVQDHTSTNH 340 
AACGAATTGATTGCTAGATTGACTCACTCTCCAGTTCAAGACCACACTTCTACTAACCAC 

961 ^ — . ^ 1020 

TTGCTTAACTAACGATCTAACTGAGTGAGAGGTCAAGTTCTGGTGTGAAGATGATTGGTG 

TLDSNPATFPLNATIiYADFS 360 

ACTTTGGACTCTAACCCAGCTACTTTCCCATTGAACGCTACTTTGTACGCTGACTTCTCT 

^ . + + 1080 

1021 + + + + * xwou 

TGAAACCTGAGATTGGGTCGATGAAAGGGTAACTTGCGATGAAACATGCGACTGAAGAGA 

HDNIMISIFFALGLYNGTKP 380 
CACGACAACACTATGATATCTATTTTCTTCGCTTTGGGTTTGTACAACGGTACCAAGCCA 

1081 ^ ^ — ^ ^ 

GTGCTGTTGTGATACTATAGATATUUVGAAGCGAAACCCAAACATGTTGCCATGGTTCGGT 

LSTTSVESIEETDGYSASWT 400 
TTGTCTACTACTTCTGTTGAATCTATTGAAGAAACTGACGGTTACTCTGCTTCTTGGACT 

1141 — - + ^ + + * * ^200 

AACAGATGATGAAGACAACTTAGATAACTTCTTTGACTGCCAATGAGACGAAGAACCTGA 

VPFAARAYVEMMQCQAEKEP 420 
GTTCCATTCGCTGCTAGAGCTTACGTTGTU^TGATGCAATGTCAAGCTGAAAAGGAACCA 

1201 * ^ ^ ^260 

CAAGGTAAGCGACGATCTCGAATGCAACTTTACTACGTTACAGTTCGACTTTTCCTTGGT 

LVRVLVNDRVVPLHGCAVDK 440 
TTGGTTAGAGTTTTGGTTAACGACAGAGTTGTTCCATTGCACGGTTGTGCTGTTGACAAG 

1261 + * * ^ ^ 

AACCAATCTCAAAACCAATTGCTGTCTCAACAAGGTAACGTGCCAACACGACAACTGTTC 
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IiGRCKRDDFVEGIiSFARSGG 460 
TTGGGTAGATGTAAGAGAGACGACTTCGTTGAAGGTTTGTCTTTCGCTAGATCTGGTGGT 

1321 + + + + + + 1380 

AACCCATCTACATTCTCTCTGCTGAAGCAACTTCCAAACAGAAAGCGATCTAGACCACCA 

NWAECFA* 467 
AACTGGGCTGAATGTTTCGCTTAA 

1381 + + 1410 

TTGACCCGACTTACAAAGCGAATT 
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Figure 8 

MGVFVVLLSIATLFGSTSGT 20 
ATGGGCGTGTTCGTCGTGCTACTGTCCATTGCCACCTTGTTCGGTTCCACATCCGGTACC 

X + + + + + + 60 

TACCCGCACAAGCAGCACGATGACAGGTAACGGTGGAACAAGCCAAGGTGTAGGCCATGG 

ALGPRGNSHSCDTVDGGYQC 40 
GCCTTGGGTCCTCGTGGTAACTCTCACTCTTGTGACACTGTTGACGGTGGTTACCAATGT 

61 + + + + + + 120 

CGGAACCCAGGAGCACCATTGAGAGTGAGAACACTGTGACAACTGCCACCAATGGTTACA 

A FPKISHLWGTYSPFFSIiADE 60 

TTCCCAGAAATTTCTCACTTGTGGGGTACATACTCTCCATTCTTCTCTTTGGCTGACGAA 

121 + + + + + + 180 

AAGGGTCTTTAAAGAGTGAACACCCCATGTATGAGAGGTAAGAAGAGAAACCGACTGCTT 

SAISPDVPKGCRVTFVQVLS 80 
TCTGCTATTTCTCCAGACGTTCCAAAGGGTTGTAGAGTTACTTTCGTTCAAGTTTTGTCT 

131 + + + + + + 240 

AGACGATAAAGAGGTCTGCAAGGTTTCCCAACATCTCAATGAAAGCAAGTTCAAAACAGA 

RHGARYPTSSASKj^YSAIilE 100 
AGACACGGTGCTAGATACCCAACTTCTTCTGCGTCTAAGGCGTACTCTGCTTTGATTGAA 

241 + + + + + + 300 

TCTGTGCCACGATCTATGGGTTGAAGAAGACGCAGATTCCGCATGAGACGAAACTAACTT 

AIQKNATAFKGKYAFLKTYN 120 
GCTATTCAAAAGAACGCTACTGCTTTCAAGGGTAAGTACGCTTTCTTGAAGACTTACAAC 

301 + + + + + + 360 

CGATAAGTTTTCTTGCGATGACGAAAGTTCCCATTCATGCGAAAGAACTTCTGAATGTTG 

A YTLGADDIiTPFGEQQMVNSG 140 

TACACTTTGGGTGCTGACGACTTGACTCCATTCGGTGAACAACAAATGGTTAACTCTGGT 

361 + + + + - — + + 420 

ATGTGAAACCCACGACTGCTGAACTGAGGTAAGCCACTTGTTGTTTACCAATTGAGACCA 

IKFYRRYKAIiARKIVPFiRA 160 
ATTAAGTTCTACAGAAGATACAAGGCTTTGGCTAGAAAGATTGTTCCATTCATTAGAGCT 

421 + + + + + + 480 

TAATTCAAGATGTCTTCTATGTTCCGAAACCGATCTTTCTAACAAGGTAAGTAATCTCGA 

SGSDRVIASAEKFIEGFQSA 180 
TCTGGTTCTGACAGAGTTATTGCTTCTGCTGAAAAGTTCATTGAAGGTTTCCAATCTGCT 

481 + + - + + + + 540 

AGACCAAGACTGTCTCAATAACGAAGACGACTTTTCAAGTAACTTCCAAAGGTTAGACGA 

KLADPGANPHQASPVINVII 200 
AAGTTGGCTGACCCAGGTGCTAACCCACACCAAGCTTCTCCAGTTATTAACGTTATTATT 

541 + + + + + + 600 

TTCAACCGACTGGGTCCACGATTGGGTGTGGTTCGAAGAGGTCAATAATTGCAATAATAA 

PEGAGYNKTTLDHGLCTAFEE 220 
CCAGAAGGTGCTGGTTACAACAACACTTTGGACCACGGTTTGTGTACTGCTTTCGAAGAA 

601 + + + + + + 660 

GGTCTTCCACGACCAATGTTGTTGTGAAACCTGGTGCCAAACACATGACGAAAGCTTCTT 
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SE LGDDVEANFTAVFAPPIR 240 
TCTGAATTGGGTGACGACGTTGAAGCTAACTTCACTGCTGTTTTCGCTCCACCAATTAGA 

661 + + + + + + 720 

AGACTTAACCCACTGCTGCAACTTCGATTGAAGTGACGACAAAAGCGAGGTGGTTAATCT 

ARIiEAHLPGVNLTDEDVVNIi 260 
GCTAGATTGGAAGCTCACTTGCCAGGTGTTAACTTGACTGACGAAGACGTTGTTAACTTG 

721 + + + + + + 780 

CGATCTAACCTTCGAGTGAACGGTCCACAATTGAACTGACTGCTTCTGCAACAATTGAAC 

MDMCPFDTVARTSDATQLSP 280 
ATGGACATGTGTCCATTCGACACTGTTGCTAGAACTTCTGACGCTACTCAATTGTCTCCA 

761 + + + + + 840 

TACCTGTACACAGGTAAGCTGTGACAACGATCTTGAAGACTGCGATGAGTTAACAGAGGT 

FCDIiFTHDEWIQYDYIiQSIiQ 300 

TTCTGTGACTTGTTCACTCACGACGAATGGATTCAATACGACTACTTGCAATCTTTGGGT 

. 4. + + 900 

841 + + + + + f 

AAGACACTGAACAAGTGAGTGCTGCTTACCTAAGTTATGCTGATGAACGTTAGAAACCCA 

KYYGYGAGNPLGPAQGVGFV 320 

AAGTACTACGGTTACGGTGCTGGTAACCCATTGGGTCCAGCTCAAGGTGTTGGTTTCGTT 

^ . + + + + 960 

901 + + + ^ 

TTCATGATGCCAATGCCACGACCATTGGGTAACCCAGGTCGAGTTCCACAACCAAAGCAA 

NELIARLTHSPVQDHTSTNH 340 
AACGAATTGATTGCTAGATTGACTCACTCTCCAGTTCAAGACCACACTTCTACTAACCAC 

961 ^ * ^ ^ + ^020 

TTGCTTAACTAACGATCTAACTGAGTGAGAGGTCAAGTTCTGGTGTGAAGATGATTGGTG 

TI-DSNPATFPLNATLYADFS 360 

ACTTTGGACTCTAACCCAGCTACTTTCCCATTGAACGCTACTTTGTACGCTGACTTCTCT 

. . 4. + 1080 

1021 + + + + * 

TGAAACCTGAGATTGGGTCGATGAAAGGGTAACTTGCGATGAAACATGCGACTGAAGAGA 

HDNTMVSIFFAL.GIiYNGTKP 380 
CACGACAACACTATGGTTTCTATTTTCTTCGCTTTGGGTTTGTACAACGGTACTAAGCCA 

1081 ^ + * ^ ^"^ 

GTGCTGTTGTGATACCAAAGATAAAAGAAGCGAAACCCAAACATGTTGCCATGATTCGGT 

LSTTSVESIEETDGY£ASWT 400 
TTGTCTACTACTTCTGTTGAATCTATTGAAGAAACTGACGGTTACTCTGCTTCTTGGACT 

1141 ^ + + ^ ^ + 1200 

AACAGATGATGAAGACAACTTAGATAACTTCTTTGACTGCCAATGAGACGAAGAACCTGA 

VPFAAR AYVEMMQCEAEKEP 420 
GTTCCATTCGCTGCTAGAGCTTACGTTGAAATGATGCAATGTGAAGCTGAAAAGGAACCA 

1201 ^260 

CAAGGTAAGCGACGATCTCGAATGCAACTTTACTACGTTACACTTCGACTTTTCCTTGGT 

LVRVIiVNDRVVPIiHGCGVDK 440 
TTGGTTAGAGTTTTGGTTAACGACAGAGTTGTTCCATTGCACGGTTGTGGTGTTGACAAG 

1261 + ^ ^ ^ "20 

AACCAATCTCAAAACCAATTGCTGTCTCAACAAGGTAACGTGCCAACACCACAACTGTTC 
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LGRCKRDDFVEGLSFARSGG 
TTGGGTAGATGTAAGAGAGACGACTTCGTTGAAGGTTTGTCTTTCGCTAGATCTGGTGGT 

1321 + + + + * 

AACCCATCTACATTCTCTCTGCTGAAGCAACTTCCAAACAGAAAGCGATCTAGACCACCA 

NWEECFA* 467 
AACTGGGAAGAATGTTTCGCTTAA 

1381 + + 1404 

TTGACCCTTCTTACAAAGCGAATT 
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Figurg g 

MGVFVVLLSIATLFGSTSGT 20 
ATGGGGGTTTTCGTCGTTCTATTATCTATCGCGACTCTGTTCGGCAGCACATCGGGCACT 

^ ^ ^ + + + + 60 

TACCCCCAAAAGCAGCAAGATAATAGATAGCGCTGAGACAAGCCGTCGTGTAGCCCGTGA 

ADGPRGNHSKSCDTVDLGYQ 40 
GCGCTGGGCCCCCGTGGAAATCACTCCAAGTCCTGCGATACGGTAGACCTAGGGTACCAG 

61 ^ ^ + * + 120 

CGCGACCCGGGGGCACCTTTAGTGAGGTTCAGGACGCTATGCCATCTGGATCCCATGGTC 

CSPATSHLWGOIYSPXFSLED 60 
TGCTCCCCTGCGACTTCTCATCTATGGGGCACGTACTCGCCATaCTTTTCGCTCGAGGAC 

121 + ^ + + ^ 

ACGAGGGGACGCTGAAGAGTAGATACCCCGtgCATGAGCGGTAtGAAAAGCGAGCTCCTG 

ELSVSSKLPKDCRITLVQVL 80 
GAGCTGTCCGTGTCGAGTAAGCTTCCCAAGGATTGCCGGATCACCTTGGTACAGGTGCTA 

181 + ^ + ^ ^ 240 

CTCGACAGGCACAGCTCATTCGAAGGGTTCCTAACGGCCTAGTGGAACCATGTCCACGAT 

SR HGARYPTSSKSKK YKKLi 100 
TCGCGCCATGGAGCGCGGTACCCAACCAGCTCCAAGAGCAAAAAGTATAAGAAGCTTaTt 

_ ^ ^ ^ + + + 300 

AGCGCGGTACCTCGCGCCATGGGTTGGTCGAGGTTCTCGTTTTTCATATTCTTCGAAtAa 

TAIQANATDFKGKXAFLKTY 120 

ACGGCGATCCAGGCCAATGCCACCGACTTCAAGGGCAAGTacGCCTTTTTGAAGACGTAC 

. + + + 360 

301 + + 

TGCCGCTAGGTCCGGTTACGGTGGCTGAAGTTCCCGTTCAtgCGGAAAAACTTCTGCATG 

NYTLGADDLTPFGEQQLVNS 140 
AACTATACTCTGGGTGCGGATGACCTCACTCCCTTTGGGGAGCAGCAGCTGGTGAACTCG 

361 + + ^ + ^ 420 

TTGATATGAGACCCACGCCTACTGGAGTGAGGGAAACCCCTCGTCGTCGACCACTTGAGC 

GIKFYQRYKALARSVVPFIR 160 
GGCATCAAGTTCTACCAGAGGTACAAGGCTCTGGCGCGCAGTGTGGTGCCGTTTATTCGC 

421 + + + * + "0 

CCGTAGTTCAAGATGGTCTCCATGTTCCGAGACCGCGCGTCACACCACGGCAAATAAGCG 

ASGSDRVIASGEKFIEGFQQ 180 



GCCTCAGGCTCGGACCGGGTTATTGCTTCGGGAGAGAAGTTCATCGAGGGGTTCCAGCAG 

+ + + + + + 

CGGAGTCCGAGCCTGGCCCAATAACGAAGCCCTCTCTTCAAGTAGCTCCCCAAGGTCGTC 



AKIiADPGATNRAAPAlSVII 200 
GCGAAGCTGGCTGATCCTGGCGCGACGAACCGCGCCGCTCCGGCGATTAGTGTGATTATT 

^ ^ + + + 600 

CGCTTCGACCGACTAGGACCGCGCTGCTTGGCGCGGCGAGGCCGCTAATCACACTAATAA 

PESETFNNTIiDHGVCTKFEA 220 
CCGGAGAGCGAGACGTTCAACAATACGCTGGACCACGGTGTGTGCACGAAGTTTGAGGCG 

601 + ^ + ^ * "° 

GGCCTCTCGCTCTGCAAGTTGTTATGCGACCTGGTGCCACACACGTGCTTCAAACTCCGC 



5B08.0 10/22 0999/KaPe 



110 



SQLGDEVAANFTALFAPDIR 240 
AGTCAGCTGGGAGATGAGGTTGCGGCCAATTTCACTGCGCTCTTTGCACCCGACATCCGA 

651 . + ^ "-7-^ — * -^20 

TCAGTCGACCCTCTACTCCAACGCCGGTTAAAGTGACGCGAGAAACGTGGGCTGTAGGCT 

ARIiEKHLPGVTLTDEDVVSL 260 
GCTCGCctCGAGAAGCATCTTCCTGGCGTGACGCTGACAGACGAGGACGTTGTCAGTCTA 

721 --- + + ^ ZJ^- * "^^^ 

CGAGCGgaGCTCTTCGTAGAAGGACCGCACTGCGACTGTCTGCTCCTGCAACAGTCAGAT 

MDMC£FDTVART ._S.D A S Q L S P 280 
ATGGACATGTGTCCGTTTGATACGGTAGCGCGCACCAGCGACGCAAGTCAGCTGTCACCG 

781 + ^ + - — ^ ^ "° 

TACCTGTACACAgGCAAACTATGCCATCGCGCGTGGTCGCTGCGTTCAGTCGACAGTGGC 

FCQLFTHNEWKKYJttYLQSLG 300 
TTCTGTCAACTCTTCACTCACAATGAGTGGAAGA;^ACgACTACCTTCAGTCCTTGGGC 

841 + + ^ + + 900 

7V7in7i^ar!rrTnar,RAnTGAGTGTTACTCACCTTCT TCATGc TGATGGAAGTCAGGAACCCG 

KVYGYGAGNPL G _P_ A Q G I G F T 320 
AAGTACTACGGCTACGGCGCAGGCAACCCTCTGGGACCGGCTCAGGGGATAGGGTTCACC 

901 + + + -----I£lI+- ^^"^ 

TTCATGATGCCGATGCCGCGTCCGTTGGGAGACCCTGGCCGAGTCCCCTATCCCAAGTGG 

NELIARLTRSPVQDHTSTNS 340 
AACGAGCTGATTGCCCGGTTGACgCGTTCGCCAGTGCAGGACCACACCAGCACTAACTCG 

+ + + ^ + 

TTGCTCGACTAACGGGCCAACTGCGCAAGCGGTCACGTCCTGGTGTGGTCGTGATTGAGC 

TLVSNPATFPL N_A_T M Y V D F S 360 
ACTCTAGTCTCCAACCCGGCCACCTTCCCGTTGAACGCTACCATGTACGTCGACTTTTCA 

+ + + 1080 

1021 + + + --- ^f- 

TGAGATCAGAGGTTGGGCCGGTGG^iAGGGCAACTTGCGATGGTACATGCAGCTGAAAAGT 

HDNSMVSIFFA L_G _^ L Y N G T E P 380 
CACGACAACAGCATGGTTTCCATCTTCTTTGCATTGGGCCTGTACAACGGCACTGT^CCC 

1081 -H * + ^ 

GTGCTGTTGTCGTACCAAAGGTAGAAGAAACGTAACCTOGACATGTTGCCGTGACTTGGG 

LSRTSVESAKE L D G Y S A S W V 400 
TTGTCCCGGACCTCGGTGGAAAGCGCCAAGGAATTGGATGGGTATTCTGCATCCTGGGTG 

1141 ^ + * ^ 

AACAGGGCCTGGAGCCACCTTTCGCGGTTCCTTA^CTACCCATAAGACGTAGGACCCAC 

VPFGARAYFET M Q C K S E K E P 420 
GTGCCTTTCGGCGCGCGAGCCTACTTCGAGACGATGCAATGCAAGTCGGAAAAGGAGCCT 

1201 ^ 

CACGGAAAGCCGCGCGCTCGGATGAAGCTCTGCTACGTTACGTTCAGCCTTTTCCTCGGA 

LVRALINDRVV P_Ii„,H G C D V D K 440 
CTTGTTCGCGCTTTGATTAATGACCGGGTTGTGCCACTCCATGGCTGCGATGTGGACAAG 

1261 ^ + ^-—SZZZ^'^^. " 

GAACAAGCGCGAAACTAATTACTGGCCCAACACGGTGACGTACCGACGCTACACCTGTTC 
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IiGRCKLNDFVKGLSWARSG G 
CTGGGGCGATGCAAGCTGAATGACTTTGTCAAGGGArrGAGTTGGGCCAGATCTGGGGGC 

1321 + + + -- + + ■*■ 

GACCCCGCTACGTTCGACTTACTGAAACAGTTCCCTJ?ACTCAACCCGGTCTAG^^ 

NWGECFS* 467 
AACTGGGGAGAGTGCTTTAGTTGA . 

1381 + + 1404 

TTGACCCCTCTCACGAAATCAACT 
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Figure 10 

CP-l 

ECO RIMGVFVVLLSIATLFGST 
TATATGAATrCAIfiGGCGTGTTCOTCGTGCTACTGTCCATTGCCACCTTGTTCGGTTCCA 

^ ^ + + + + + 60 

ATATACTTAAGTACCCGCACAAGCAGCACGATGACAGGTAACGGTGGAACAAGCCAAGGT 

SGTAIiGPRGNSHSCDTVDGG 
CATCCGGTACCOCCTTGGGTCCTCGTGGTAATTCTCACTCTTGTGACACTGTTGACGGTG 

ex . ^ . 120 

GTAGGCCATGGCGGAACCCAGGAGCACCATTAAGAGTGAGAACACTGTGACAACTGCCAC 

CP-2 

CP- 3 

YQCFPEISHLWGQYSPYFSL 
GTTACCAATGTTTCCCAGAAATTTCTCACTTGTGGQGTCAATACTCTCCATACTTCTCTT 

121 . + ^ "0 

CAATGGTTACAAAGGGTCTTTAAAGAOTGAACACCCCAGTTATGAGAGGTATGAAGAGAA 

EDESAISPDVP DDCRV T F V Q 
TGGAAGACGAATCTGCTATTTCTCCAGACGTTCCAGACGACTGTAGAGTTACTTTCGTTC 

181 + + + ^ 240 

ACCTTCTGCTTAGACGATAAAGAGGTCTGCAAGGTCTGCTGACATCTCAATGAAAGCAAG 

CP-4.7 

CP-5.7 

VLSRHGARYPTESKfiKl^YSA 
AAGTTTTGTCTAGACACGGTGCTAGATACCCAACTgacTCTAAGggtAAGaagTACTCTG 

241 . + ^ ^ ^O^' 

TTCAAAACAGATCTGTGCCACGATCTATGGGTTGActgAGATTCccaTTCttcATGAGAC 

LIEAIQKNATAFKGKYAFLK 
CTCTGATTGAAGCTATTCAAAAGAACGCTACTGCTTTCAAGGGTAAGTACGCTTTCTTGA 

301 + + + + ^ 

GAAACTAACTTCGATAAGTTTTCTTGCGATGACGAAAGTTCCCATTCATGCGAAAGAACT 

CP-6 

CP-7 

TYNYTLGADDLTPFGENQMV 
AGACTTACAACTACACTTTGGGTGCTGACGACTTGACTCCATTCGGTGAAAACCAAATGG 

361 + - — + ^ + * * 420 

TCTGAATGTTGATGTGAAACCCACGACTGCTGAACTGAGGTAAGCCACTTTTGGTTTACC 

NSGIKFYRRYKALARKIVPF 
TTAACTCTGGTATTAAGTTCTACAGAAGATACAAGGCTTTGGCTAGAAAGATTGTTCCAT 

421 + + + + ^ 480 

AATTGAGACCATAATTCAAGATGTCTTCTATGTTCCGAAACCGATCTTTCTAACAAGGTA 

CP-9,7 

CP-9 

IRASGSSRVIASAEKFIEGF 
TCATTAGAGCTTCTGGTTCTtCtAGAGTTATTGCTTCTGCTGAAAAGTTCATTGAAGGTT 

481 . + — + - ^ 540 

AGTAATCTCGAAGACCAAGAagaTCTCAATAACQAAGACGACTTTTCAAGTAACTTCCAA 

QSAKLADPGSQPHQASPVID 
TCCAATCTGCTAAQTTGGCTGACCCAGGTTCTCAACCACACCAAGCTTCTCCAGTTATTG 

541 + + + 



AGG' 



-TTAGACGATTCAACCGACTGGGTCCAAGAGTTGGTGTGGTTCGAAGAGGTCAATAAC 
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VI ISEASSYNMTLD£GTCTA 
ACGTTATTATTtctGAcgctTCTtctTACAACAACACTTTGGACccaGGTACTTGTACTG 

601 + + + + + + 660 

TGCAATAATAAagaCTgcgaAGGagaJ^TGTTGTTGTGAAACCTGggtCCATGAACATGAC 

FEDSEL^DTVEANFTALFAP 
CTTTCGAAGACTCTGAATTGgctGACactGTTGAAGCTAACTTCACTGCTTTGTTCGCTC 

661 + + + + + + 720 

GAAAGCTTCTGAGACTTAACcgaCTGtgaCAACTTCGATTGAAGTGACGAAACAAGCGAG 

CP"12i7 

AIRARLEADLPGVTLTDTEV 
CAGCTATTAGAGCTAGATTGGAAGCTGACTTGCCAGGTGTTACTTTGACTGACactgaaG 

721 + + + + + + 780 

GTCGATAATCTCGATCTAACCTTCGACTGAACGGTCCACAATGAAACTGACTGtgacttC 

CP-13.7 

lYLMDMCSFETVARTSDATE 
TTactTACTTGATGGACATGTGTtCtTTCGAAACTGTTGCTAGAACTTCTGACGCTACTG 

781 + + + + + + 840 

AAtgaATGAACTACCTGTACACAagcJU^GCTTTGACAACGATCTTGAAGACTGCGATGAC 

LSPFCALFTHDEWRHYDYLQ 
AATTGTCTCCATTCTGTGCTTTGTTCACTCACGACGAATGGAGAcacTACGACTACTTGC 

841 + + + + + + 900 

TTAACAGAGGTAAGACACGAAACAAGTGAGTGCTGCTTACCTCTgtgATGCTGATGAACG 

CP-14.7 

CT-15.7 

SLISKYYGHGAGNPLGPXQGV 
AATCTTTGaagAAGTACTACGGTc ac GGTGCTGGTAACCCATTGGGTCC Aac t CAAGGTG 

901 + + + + + + 960 

TTAGAAACttcTTCATGATGCCAgtgCCACGACCATTGGGTAACCCAGGTtgaGTTCCAC 

GFANELIARLTRSPVQDHTS 
TTGGTTTCGCTAACGAATTGATTGCTAGATTGACTAGATCTCCAGTTCAAGACCACACTT 

961 + + + + + + 1020 

AACCAAAGCGATTGCTTAACTAACGATCTAACTGATCTAGAGGTCAAGTTCTGGTGTGAA 
CP-16 

CP-17.7 

TNHTLDSNPATFPLNATLYA 
CTACTAACCACACTTTGGACTCTAACCCAGCTACTTTCCCATTOAACGCTACTTTGTACG 

1021 + + + + + + 1080 

OATGATTGGTGTGAAACCTOAGATTGGOTCGATGAAAGGGTAACTTGCGATGAAACATGC 

DFSHDNGXISIFFALGLYNG 
CTGACTTCTCTCACGACAACggtattATTTCTATTTTCTTCGCTTTGGGTTTGTACAACG 

1081 + + + + + + 1140 

GACTGAAGAGAGTGCTGTTGccataaTAAAGATAAAAGAAGCGAAACCCAAACATGTTGC 

CP-18,7 

CP-19.7 

TAPLSTTSVESIEETDGYSS 
GTACTGCTCCATTGTCTACTACTTCTGTTGAATCTATTGAAGAAACTOACGGTTACTCTt 

1141 + + + + + + 1200 

CATGACGAGGTAACAGATGATGAAGACAACTTAGATAACTTCTTTGACTGCCAATGAGAa 
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^WTVPFASRAYVEMMQCQAE 
c tgc tTGGACTGTTCCATTCgc t tc tAGAGCTTACGTTGAAATQATGCAATGTCAAGCTG 

1201 + + + + + + ^260 

gacgaACCTGACAAGGTAAGcgaagaTCTCGAATGCAACTTTACTACGTTACAGTTCGAC 

CP-20 

CP-21 

KEPLVRVLVNDRVVPLHGCA 
AAAAGGAACCATTGGTTAGAGTTTTGGTTAACGACAGAGTTGTTCCATTGCACGGTTGTG 

1261 + + + + + + 1320 

TTTTCCTTGGTAACCAATCTCAAAACCAATTGCTGTCTCAACAAGGTAACGTGCCAACAC 

VDKLGRCKRDDFVEGIiSFAR 
CTGTTGACAAOTTGGGTAGATGTAAGAGAGACGACTTCGTTGAAGGTTTOTCTTTCGCTA 

1321 + + ^ + — - + + 1380 

GACAACTGTTCAACCCATCTACATTCTCTCTGCTGAAGCAACTTCCAAACAGAAAGCGAT 

CP-22 

SGGNWAECFA* ECO RI 
GATCTGGTGGTAACTGGGCTGAATGTTTCGCTTAAGAATTCATATA 

1381 + + + 1426 

CTAGACCACCATTGACCCGACTTACAAAGCGAATTCTTAAGTATAT 
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Figure 12 
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Figure 13 
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Fignye 16 

84.72 
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Figure 17 
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Figure Ig 
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Fiaur-e 21 



1 MGVFWLLSI ATLFGSTSGT AIiGPRGNSHS CDTVDGGYQC FPEISSHWSE 

51 YSPXFSLADE SAISPDVPKG CRVTFVQVIjQ RHGAREPTSfi AAIRISAIilE 

101 AIQKNATAFK GKYAFLKTYN YTLGADDLiyP FGMQSSQAG IKFYRRYKAL 

151 ARKIVPFIRA SGSDRVIESA mHIEGFQSA KIADPGANPH QASPVINVII 

201 PEGAGYNNTL DHGLCTAFEE SELGDDVEAN FTAVFAPPIR ARLEAHLPGV 

251 NLTDEDWNL MDMCPFDTVA RTSDATfiLSP FCDLFTHDEW IQYDYLimLE 

3 01 KYYGJGAGNP LGPAQGVGFV NELIARLTHS PVQDHTSTNH TLDSNPATFP 

3 51 LNATLYADFS HDNTMVAIFF ALGLYNGTKP LSTTSVESIE ETDGYSASWi 

4 01 VPFSARMYVE MMQCEAEKEP LVRVLVNDRV VPLHGCGVDK LGRCKRDDFV 
4 51 EGLSFARSGG NWEECFA 
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